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We consider a connection-level model of Internet congestion con- 
trol, introduced by Massoulie and Roberts [Telecommunication Sys- 
tems 15 (2000) 185-201], that represents the randomly varying num- 
ber of flows present in a network. Here, bandwidth is shared fairly 
among elastic document transfers according to a weighted a-fair 
bandwidth sharing policy introduced by Mo and Walrand [IEEE/ACM 
Transactions on Networking 8 (2000) 556-567] [a £ (0, oo)]. Assum- 
ing Poisson arrivals and exponentially distributed document sizes, we 
focus on the heavy traffic regime in which the average load placed on 
each resource is approximately equal to its capacity. A fluid model 
(or functional law of large numbers approximation) for this stochastic 
model was derived and analyzed in a prior work [Ann. Appl. Probab. 
14 (2004) 1055-1083] by two of the authors. Here, we use the long- 
time behavior of the solutions of the fluid model established in that 
paper to derive a property called multiplicative state space collapse, 
which, loosely speaking, shows that in diffusion scale, the flow count 
process for the stochastic model can be approximately recovered as 
a continuous lifting of the workload process. 

Under weighted proportional fair sharing of bandwidth (a = 1) 
and a mild local traffic condition, we show how multiplicative state 
space collapse can be combined with uniqueness in law and an in- 
variance principle for the diffusion [Theory Probab. Appl. 40 (1995) 
1-40], [Ann. Appl. Probab. 17 (2007) 741-779] to establish a diffu- 
sion approximation for the workload process and hence to yield an 
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approximation for the flow count process. In this case, the workload 
diffusion behaves like Brownian motion in the interior of a polyhedral 
cone and is confined to the cone by reflection at the boundary, where 
the direction of reflection is constant on any given boundary face. 
When all of the weights are equal (proportional fair sharing), this 
diffusion has a product form invariant measure. If the latter is inte- 
grable, it yields the unique stationary distribution for the diffusion 
which has a strikingly simple interpretation in terms of independent 
dual random variables, one for each of the resources of the network. 
We are able to extend this product form result to the case where doc- 
ument sizes are distributed as finite mixtures of exponentials and to 
models that include multi-path routing. We indicate some difficulties 
related to extending the diffusion approximation result to values of 

We illustrate our approximation results for a few simple networks. 
In particular, for a two-resource linear network, the diffusion lives in 
a wedge that is a strict subset of the positive quadrant. This geomet- 
rically illustrates the entrainment of resources, whereby congestion 
at one resource may prevent another resource from working at full 
capacity. For a four-resource network with multi-path routing, the 
product form result under proportional fair sharing is expressed in 
terms of independent dual random variables, one for each of a set of 
generalized cut constraints. 

1. Introduction. We consider a connection-level model of Internet con- 
gestion control introduced and studied by Massoulie and Roberts [33]. This 
stochastic model represents the randomly varying number of flows present in 
a network where bandwidth is dynamically shared between flows that corre- 
spond to continuous transfers of individual elastic documents. This model, 
which we shall refer to as a flow-level model, assumes a "separation of time 
scales" such that the time scale of the flow dynamics (i.e., of document ar- 
rivals and departures) is much longer than the time scale of the packet level 
dynamics on which rate control schemes such as the Transmission Control 
Protocol (TCP) converge to equilibrium. We consider the flow-level model 
operating under a family of bandwidth sharing policies introduced by Mo 
and Walrand [35], called weighted a-fair policies. Here, a is a parameter 
lying in (0,oo). The case a = 1 is called weighted proportional fair sharing 
and the case a — > oo corresponds to what is called weighted max-min fair. 

Assuming Poisson arrivals and exponentially distributed document sizes, 
de Veciana, Lee and Konstantopoulos [11] and Bonald and Massoulie [3] 
studied stability of the flow-level model operating under weighted a-fair 
bandwidth sharing policies (including limiting values of a). Lyapunov func- 
tions constructed in [11] for weighted max-min fair and proportionally fair 
policies, and in [3] for weighted a-fair policies [a G (0, oo)], imply positive 
recurrence of the Markov chain associated with the model when the aver- 
age load on each resource is less than its capacity. Lin, Shroff and Srikant 
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[30, 31, 39] have recently given sufficient conditions for stability of a Markov 
model under a back-pressure algorithm when the assumption of time scale 
separation is relaxed. For more general document size distributions, there are 
a few results for specific values of a or for specific distributions or topologies 
that provide sufficient conditions for stability of the flow-level model oper- 
ating under bandwidth sharing policies [6, 8, 28, 32]. A summary of these 
results is provided in the introduction to [14]. In general, it remains an open 
question whether, with renewal arrivals and arbitrarily (rather than expo- 
nentially) distributed document sizes, the flow-level model is stable under an 
a-fair bandwidth sharing policy [a € (0, oo)] when the nominal load placed 
on each resource is less than its capacity (see [14, 15] for some first steps in 
this direction). Here, we restrict our attention to the case of Poisson arrivals 
and exponential document sizes, for which stability is well understood. 

We are interested in using diffusion approximations to explore the perfor- 
mance of the flow-level model operating under a weighted a-fair bandwidth 
sharing policy when the average load placed on each resource is approxi- 
mately equal to its capacity, that is, the system is heavily loaded. We are 
particularly interested in manifestations of the phenomenon of entrainment, 
whereby congestion at some resources may prevent other resources from 
working at their full capacity. 

There are several motivations for our work. One source of motivation lies 
in fixed-point approximations of network performance for TCP networks 
(see [7, 13, 36]). These approximations require, as input, information on the 
joint distribution of the numbers of flows present on different routes, where 
dependencies between these numbers may be induced by the bandwidth 
sharing mechanism. Similarly, an understanding of such joint distributions 
seems important if the performance models for a single bottleneck described 
by Ben Fredj et al. [1] are to be generalized to a network. 

Another motivation is that the flow-level model typically involves the si- 
multaneous use of several resources. Due to the exponential document sizes, 
this model can be equated (in distribution) with a stochastic processing net- 
work (SPN) as introduced by Harrison [16]. Open multiclass queueing net- 
works operating under head-of-the-line (HL) service disciplines are a special 
case of SPNs without simultaneous resource possession. For certain queue- 
ing networks of this type, it has been shown [5, 42] that suitable asymptotic 
behavior of critical fluid models implies a property called multiplicative state 
space collapse, which, in turn, validates the use of Brownian model approx- 
imations for these networks in heavy traffic. For more general SPNs, in- 
vestigation of the behavior of critical fluid models, of a related notion of 
multiplicative state space collapse and of the implications for diffusion ap- 
proximations are in the early stages of development. The analysis in this 
paper can be viewed as a contribution to such an investigation for models 
involving simultaneous resource possession. For another contribution, see 
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the paper of Ye and Yao [44], who consider a stochastic processing net- 
work with simultaneous resource possession; in contrast to the fully heavily 
loaded, multiple bottleneck situation considered here, Ye and Yao consider 
the situation of a single heavily loaded bottleneck. A further recent contri- 
bution is the important paper of Shah and Wischik [38], who have proven 
multiplicative state space collapse for a class of "switched" networks with 
multiple bottlenecks operating under a family of scheduling policies related 
to the maximum weight algorithm introduced by Tassiulas and Ephremides 
[40]. 

Finally, although we restrict to exponential document sizes in this pa- 
per, we would like to relax that assumption in future work. Although this 
involves a significantly more elaborate stochastic model (see [14]) to keep 
track of residual document sizes (because of the processor sharing nature 
of the bandwidth sharing policy), knowing the results for exponential docu- 
ment sizes is likely to be useful for such work. 

1.1. Overview. In this paper, we consider the flow-level model with Pois- 
son arrivals and exponentially distributed document sizes operating under 
a weighted a-fair bandwidth sharing policy for a € (0,oo). We focus on the 
heavy traffic regime in which the average load placed on each resource is ap- 
proximately equal to its capacity. We recall the definition of a critical fluid 
model from the prior work [26] of two of the authors; this model is a formal 
functional law of large numbers approximation to the flow-level model. The 
asymptotic behavior of this critical fluid model was studied in [26]. Here, we 
show how this behavior can be used to prove a property called multiplicative 
state space collapse. Loosely speaking, this says that in diffusion scale, the 
flow count process can be approximately recovered by a continuous lifting 
of the lower-dimensional workload process. Given the asymptotic behavior 
of the critical fluid model, our proof of multiplicative state space collapse 
follows a general line of argument pioneered by Bramson in [5], where open 
multiclass queueing networks operating under certain head-of-the-line (HL) 
service disciplines are treated. There are some differences in setup and proof 
details between our treatment and Bramson's [5]. These are described in de- 
tail at the beginning of Section 6. However, we wish to emphasize that our 
main line of argument follows that of Bramson [5]. It is interesting to note 
that, in contrast to prior results on state space collapse for open multiclass 
queueing networks, our lifting map can be nonlinear (for a^l). 

The multiplicative state space collapse result leads to a natural conjec- 
ture for a diffusion approximation to the workload process. For the case of 
weighted proportional fair sharing of bandwidth (a = 1), we combine multi- 
plicative state space collapse with uniqueness in law for the diffusion [10] and 
an invariance principle [20] for semimartingale reflecting Brownian motions 
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living in domains with piecewise smooth boundaries to obtain a diffusion ap- 
proximation for the flow count process under a mild local traffic condition. 
This diffusion lives in a polyhedral cone. It behaves like Brownian motion 
in the interior of the cone and is confined to the cone by reflection (or regu- 
lation) at the boundary where the direction of reflection is constant on any 
given boundary face. We illustrate this diffusion approximation result for a 
simple two-resource linear network. Then, the diffusion lives in a wedge that 
is a strict subset of the positive quadrant. This geometrically illustrates the 
entrainment of resources, whereby congestion at one resource may prevent 
another resource from working at full capacity. We also observe how the 
wedge can vary with the weights. Ongoing work is directed toward estab- 
lishing diffusion approximations for the workload process when a^l. We 
mention some of the difficulties associated with this. These center around 
the fact that when a^l and the workload dimension is three or higher, 
although the state space for the proposed diffusion approximation for the 
workload process is a cone, it is not a polyhedral cone. Indeed, the cone 
has curved boundary faces that intersect nonsmoothly and can even meet in 
cusp-like singularities. The current lack of a general existence and unique- 
ness theory (and an associated invariance principle) for reflecting Brownian 
motions in such domains is a major obstacle to proving the conjecture for 
a 7^1. 

In the case of proportional fair sharing, that is, when a = 1 and all of 
the weights for the bandwidth sharing policy are equal, we show that the 
approximating diffusion has a product form invariant measure. When the 
latter is integrable over the state space, our results suggest a strikingly simple 
approximation for the joint stationary distribution of the number of flows 
present on different routes under proportional fair sharing and the mild local 
traffic condition. In this, each of the resources of the network has associated 
with it a dual random variable. These dual variables are independent and 
exponentially distributed, and the formal approximation to the number of 
flows on a route is proportional to the sum of the dual variables along the 
route. 

We also indicate an extension of the product form result to the situation 
where document sizes are finite mixtures of exponential distributions. Under 
this extension, the formal approximation for the joint stationary distribu- 
tion for the number of flows present on different routes is insensitive: that is, 
the approximation does not depend on the distributions of document sizes, 
other than through the means of these distributions, provided that the dis- 
tributions are finite mixtures of exponentials. This result complements the 
known result [3, 4, 33] that, for proportional fair sharing and a small class 
of topologies and parameters, the stationary distribution for the number of 
flows present on different routes is exactly insensitive: that is, the stationary 
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distribution does not depend on the distributions of document sizes, other 
than through the means of these distributions. 

Finally, we indicate a relation to more general models with routing. There 
is considerable interest in multi-path routing in the Internet and rate con- 
trol schemes generalizing TCP have been proposed [19, 25]. It is known 
that the stability region for the flow-level model may be strictly increased if 
multi-path routing is allowed [19, 27]. We show that our results on diffusion 
approximations under proportional fair sharing extend to the multi-path 
case. The local traffic condition becomes more difficult to verify in this set- 
ting, but if it is satisfied, then our results suggest a simple approximation 
for the stationary distribution of the numbers of source-destination flows in 
terms of independently distributed dual variables, one for each generalized 
cut constraint. 

A summary of some of the results of this paper (without proofs) was given 
in the conference proceedings papers [22] and [23]. 

1.2. Notation and terminology. For each positive integer d> 1, M. d will 
denote (i-dimensional Euclidean space and the positive orthant in this space 
will be denoted by = {x G M. d : Xi > for i = 1, . . . , d}. When d = 1, we 
sometimes write M instead of K 1 , and M + instead of . The Euclidean norm 
of x G K will be denoted by \x\. Vectors in M. d will be assumed to be column 
vectors unless specifically indicated otherwise. The transpose of a vector or 
matrix will be denoted by the use of a superscript ""' . Inequalities between 
vectors in M. d will be interpreted componentwise. That is, for x, y G M. d , x < y 
is equivalent to X{ < yi for i = 1, . . . , d. For each x G M. d and each set S C M. d , 
the distance between x and S is denoted by 

d(x, S) = inf{|x — y\ : y 6 S}. 

For i,i/6E, x V y = max{x,y}. For each [x\ denotes the largest 

integer less than or equal to x. Given a vector x G ]R rf , the d x d diagonal 
matrix with the entries of x on its diagonal will be denoted by diag(x). For 
positive integers d± and d2, the norm of a d\ x c?2 matrix A will be given by 

/ di d 2 \ 1/2 

u\\= EE4- • 
\i=ij=i / 

The set of nonnegative integers will be denoted by Z + and the set of points 
in with all integer coordinates will be denoted by Z^. A sum over an 
empty set of indices will be taken to have a value of zero. The cardinality 
of a finite set S will be denoted by \S\. For < s < t < oo, any integer d > 1 
and any bounded function x : [s,t] — > M. d , let ||x(-)||[ S)t ] = sup ug [ s t j and 
when s = 0, let ||a;(-)|| t = ||z(-)||[o,t]- 
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All stochastic processes in this paper will be assumed to have sample 
paths that are right-continuous with finite left limits (r.c.1.1.). We denote 
by O([0, oo),R d ) the space of r.c.1.1. functions from [0, oo) into M. d and we 
endow this space with the usual Skorokhod Ji-topology. We denote by 
C([0, oo),R d ) the space of continuous functions from [0, oo) into R rf . The 
Borel a-algebra on either D([0,oo),R d ) or C([0, oo),R d ) will be denoted by 
B d . Consider X, X 1 , X 2 , . . . , each of which is a d-dimensional process (pos- 
sibly defined on different probability spaces). The sequence {I"}™ =1 is said 
to be tight if the probability measures induced by the X n on the measur- 
able space (B([0, oo),W i ),B d ) form a tight sequence, that is, they form a 
weakly relatively compact sequence in the space of probability measures on 
(B([0,co),R d ),£ d ). The notation "X n => X" will mean that as n-> oo, the 
sequence of probability measures induced on (B([0, oo),M. d ),B d ) by {X n } 
converges weakly to the probability measure induced on the same space by 
X. We shall describe this in words by saying that X n converges weakly (or 
in distribution) to X as n — > oo. The sequence of processes {X n y^ =1 is called 
C-tight if it is tight and if each weak limit point, obtained as a weak limit 
along a subsequence, almost surely has sample paths in C([0, oo),R d ). 

2. Flow-level model. 

2.1. Network structure. We consider a network with finitely many re- 
sources labeled by j € J ^ 0. A route i is a nonempty subset of JT (interpreted 
as the set of resources used by route i). We are given a finite, nonempty set I 
of allowed routes. Let J = |J|, the total number of resources, and I = |I|, the 
total number of routes. Let A be the J x I incidence matrix which contains 
only zeros and ones, defined such that Aji = 1 if resource j is used by route 
i and Aji = otherwise. We assume that A has rank J so that it has full row 
rank. We further assume that resource (bandwidth) capacities (Cj : j £ J) 
are given and that these are all strictly positive and finite. 

2.2. Stochastic primitives. An active flow on route i corresponds to the 
continuous transmission of a document through the resources used by route 
i. Transmission is assumed to occur simultaneously through all resources 
on route i. It is assumed that a new document arrives to route i at each 
jump time of a Poisson process that has rate parameter Vi > and that 
each such document has an exponentially distributed size with mean 
where fii € (0,oo). These document sizes are assumed to be independent of 
one another and to be independent of all arrival times of documents. The 
number of documents on route i at time zero is assumed to be independent 
of the remaining sizes of those documents and these sizes are assumed to be 
independent and exponentially distributed with mean Initial numbers 
and sizes of documents, arrival times of new documents and their sizes for 
different routes i G I are assumed to be mutually independent. 
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2.3. Bandwidth sharing policy. Bandwidth capacity is allocated dynami- 
cally to the routes according to the following bandwidth sharing policy which 
was first introduced by Mo and Walrand [35]. The bandwidth for a route is 
shared equally among all of the documents currently being transmitted over 
that route. Given a fixed parameter a G (0, oo) and strictly positive weights 
(Kj : % G 1), if Ni(t) denotes the (random) number of flows on route i at time 
t for each i G I and N(t) = (Ni(t) :i G I), then the bandwidth allocated to 
route i at time t is given by Ai(N(t)) and this bandwidth is shared equally 
among all of the flows on route i, where the function A(-) = (Aj(-) : i G I) is 
defined as follows (we define it on all of as we shall later apply it to 
rescaled versions of N). 

Let A:M} + — > R\ be defined such that for each n G M+, Aj(n) = for 
i G Io( n ) = {Z G I:n; = 0}, and when I+(n) = {I G I:nj > 0} is nonempty, 
A + (n) = (Aj(rt) : i G I+(ti)) is the unique value of A + = (A, : i G I+(n)) that 
solves the optimization problem 

maximize G n (A + ) 

(1) subject to AjiAi < Cj, j G J, 

ien+(n) 

over Aj > 0, iGl+(n), 

where for n G Mi. \ {0} and A+ = (Aj : i G I+(n)) G Ej +(n)l , 



(2) G n (A + ) 



A 1 "" 

, t - ? , I — a 

iel+(n) 

^ KjnjlogAj, if a = l 

I i€l + (n) 



and the value of the right member above is taken to be — oo if a G [l,oo) 
and Aj = for some i G 1+ (n) . The resulting bandwidth allocation is called 
a weighted a- fair allocation. 

The properties of the function A are summarized in the following propo- 
sition. This proposition is proved in the Appendix of Kelly and Williams 
[26]. 

Proposition 2.1. For each n g R\, 

(i) Aj(n) > for each i G l + {n); 

(ii) A{rn) = A{n) for each r > 0; 

(iii) Aj(-) is continuous at n for those i such that m > 0; 

(iv) there exists at least one p G depending on n, such that 

(3) A i {n) = n i \— % —r-\ for all i Gl+(n), 
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where 

(4) Pj (c 3 - £ AjiAi (n))=0 for all j G J. 

The (pj :j S J) are Lagrange multipliers (or dual variables) for the op- 
timization problem, where there is one multiplier for each of the capacity 
constraints. 



2.4. Stochastic process description. The flow count process N = (iVj : i G 
I) is a Markov process with state space l} + ■ We use the following (equivalent 
in distribution) representation for N and the cumulative unused capacity 
process U = (Uj :j G J): 

(5) N i (t) = N i (0)+E i (t)-Si(T i (t)), iei, 

(6) Uj (t) =Cjt-^2 AjiTi (t) , j G J, 

where Ei is a Poisson process with rate z/j, 5^ is a Poisson process with rate 
/ij, Tj(i) is the cumulative amount of bandwidth allocated to route i up to 
time t and 

(7) Ti(t)= [ Ai(N(s))ds. 

Jo 

We assume that for each i G I, Ei and Si are represented by 

(8) Ei(t) = supL>0:f2&(l)<t\ 
and 

(9) 5i(t)=sup|n>0:^Ci(0<*| 5 

respectively, where is a sequence of i.i.d. exponential random vari- 

ables with mean l/z^ and {C«(0}£i * s a sequence of i.i.d. exponential random 
variables with mean It is assumed that {£i(0}£i> {Ci(0}zSi> A^(0), 

for i G I, are mutually independent. We define an (average) workload process 
by 

(10) W(t) = AM~ l N(t) for all t > 0, 

where M = diag(/i) is the I x I diagonal matrix with the entries of [i on its 
diagonal. 
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3. Sequence of systems and scaling. Consider an increasing sequence of 
positive scale parameters {ri}f2 =1 which converges to infinity. To ease the 
notation, we shall simply write r in place of 77, where it is understood that 
r increases to infinity through a sequence. We consider a sequence of flow- 
level models indexed by r, where the network structure with parameters 
A and C, and bandwidth sharing policy with parameters a and {Ki,i G 1} 
do not vary with r. Each member of the sequence is a stochastic system, 
as described in the previous section. We append a superscript of r to any 
process, sequence of random variables or parameter associated with the rth 
system that depends on r. Thus, we have processes N r ,W r ,U r ,T r , W , S r , 
sequences of random variables = {£[(7)}^ and Q = {Q(l)}fZi f° r £ G I, 
parameters v r and fi r , and matrices M T . Let p\ = v\j 1 n\ for each i G I. We 
shall henceforth assume that the following heavy traffic condition holds. 

Assumption 3.1 (Heavy traffic). There exist v,(j,eR\ and 9 G R J such 
that Vi > and fii > for all % G I, 

(11) v r — > v and fi r — > fi as r — > 00, 

(12) r(Ap r -C)^9 asr^oo. 

Let M = diag(^) and Pi = for all i el. We note that (11)-(12) imply 
that p r — > p as r — > 00 and ^4/? = C. 

Remark 3.1. Assumption 3.1 thus implies that all resources are heavily 
loaded. We do not consider the case where some resources are underloaded; 
however, we conjecture that the diffusion approximation in this case would 
be as if these underloaded resources were deleted from the model. 

We define fluid scaled processes N , W , U , T , E , S as follows. For each 
r and t > 0, let 

(13) W(t) = N r (rt)/r, W\t) = W r (rt)/r, 

(14) U r (t) = U r {rt)/r, ¥ \t) = T r (rt)/r, 

(15) E r (t) = E r (rt)/r, S r {t) = S r {rt)/r. 

We define diffusion scaled processes N r ,W r ,U r , E r , S r as follows. For each 
r and t > 0, let 

N r (r 2 t) 



(16) N r (t) 

(17) W r (t) 



r 
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(18) 



U r (t) 



U r {rH) 



r 



(19) 



E r (t) 



E r {rH)-v r rH 



r 



(20) 



S r (rH)-n r rH 



r 



As El, SI, i € I, are independent Poisson processes with parameters satis- 
fying the convergence conditions (11), it follows that we have the following 
well-known functional central limit result [2]: 



where E and S are independent I-dimensional Brownian motions starting 
from the origin with zero drift and covariance matrices diag(z/) and diag(/Li), 
respectively. 

Finally, we assume that, independent of (21), W r (Q) converges in distri- 
bution as r — > oo to a J-dimensional random variable. 

4. Fluid model. In this section, we recall some definitions and results 
established in the prior work [26]. These will be needed for our statement 
and proof of multiplicative state space collapse. 

4.1. Fluid model solution. A fluid model solution can be thought of as 
a formal limit of the sequence {N } as r — > oo. In fact, if one assumes that 
N (0) converges in distribution as r — > oo to a random variable taking values 
in then one can show (see the Appendix of [26]) that {(T^iT, S r , ~N r , Tf)} 
is C-tight and any weak limit point (T, E, S, N, U) yields a fluid model so- 
lution N a.s. 

The following notions are used in the definition of a fluid model solution 
given below. A function f = (fi, ■ ■ ■ , fx) '■ [0, oo) — > is said to be abso- 
lutely continuous if each of its components fi : [0, oo) — > M + , i = 1, . . . , I, is 
absolutely continuous. A regular point for an absolutely continuous function 
/ : [0, oo) — > is a value of t G (0, oo) at which each component of / is dif- 
ferentiable. [Since / is absolutely continuous, almost every time t G (0, oo) is 
a regular point for /. Furthermore, / can be recovered by integration from 
its a.e. defined derivative.] 

Definition 4.1. A fluid model solution is an absolutely continuous 
function n: [0, oo) — » such that at each regular point t > for n(-), we 
have, for each i El, 



(21) 



(E r ,S r )=>(E,S) 



as r 



•DC: 



(22) 




if ni(t) > 0, 
if m(t) = o 
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and for each j G J, 

(23) A ji k i (n{t))+ A HPi< C 3> 

i&+(n(t)) i&o(n(t)) 

where l+(n(t)) ={iel:ni(t)> 0} and I (n(t)) = {i El:m(t) = 0}. 

Remark 4.1. Note that we are not assuming uniqueness of fluid model 
solutions given the initial state. 

4.2. Invariant manifold. 

Definition 4.2. A state no G K+ is called invariant (for the fluid model) 
if there is a fluid model solution n(-) such that n(i) = no for all t > 0. Let A^q 
denote the set of all invariant states. We call A4 a the invariant manifold. 

Remark 4.2. Although a G (0,oo) is fixed throughout, we indicate the 
dependence of A4 a on a explicitly here as it will be useful later on when we 
explain how the state space for the proposed workload diffusion approxima- 
tion varies with a. 

Various characterizations of the invariant states were given in [26]. We 
summarize these in Theorem 4.1 below. For this, we need the following 
definitions. 

For each n G R+, define w(n) = (wj(n) : j G J), to be given by 

(24) Wj (n) = Y J A ji r ^, jGX 

We call w(n) the workload associated with n. 

For each w G K+, define A(w) to be the unique value of n G that solves 
the following optimization problem: 

minimize F(n) 

(25) subject to Ajj — > Wj, j G J, 



over rii > 0, i G I, 



where 

(26) F(n) = -L- ^iKi/W-) * 1 . nGR^. 

(This function i 7 was introduced in [3] as a Lyapunov function for the fluid 
model. In fact, it is a Lyapunov function for the original flow count process 
N and can be used to show positive recurrence of N when the average 
load on each resource is less than its capacity.) The function A has the two 
properties stated in the next proposition. 
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Proposition 4.1. The function A:R+^R+ is continuous. Further- 
more, for each w G Ri and c> 0, 

(27) A(cw)=cA(w). 

Proof. The first property is proved in Lemma 6.3 of [26] . For the second 
property, note that for w G R+ and c > 0, cA(w) satisfies the constraints in 
(25) with cw in place of w and so 

(28) c a+1 F(A(w)) = F(cA(w)) > F(A(cw)). 

On the other hand, by writing w/c in place of w in (28), we find that 

c a+l F(A(w/c))>F(A(w)) 
and then, by replacing c by 1/c and rearranging, we obtain 

(29) F{A(cw)) > c a+1 F(A(w)). 
On combining (28) and (29), we conclude that 

F(A(cw)) = F(cA(w)) 

and by uniqueness of the solution to (25), we obtain the second property. 
□ 

Theorem 4.1. The following are equivalent for n G R]j_: 

(i) n is an invariant state (for the fluid model), that is, n G M a > 

(ii) Aj(n) = pi for all i G I+(n) = {/ G I:n; > 0}; 

(iii) there exists some q G R^ such that 

(30) Hi = Pi f '' jAj ') for all i G I; 

(iv) n = A(w(n)). 

Proof. This follows immediately from Lemma 5.1 and Theorems 5.1, 
5.3 of [26]. □ 

Remark 4.3. Note that if the conditions of Theorem 4.1 are satisfied, 
then p = q satisfies conditions (3) and (4), and thus (qj'j G J) are dual 
variables for the optimization problem (1); note that we use the fact that 
Ap = C for this. We have chosen to use q to denote the dual variables asso- 
ciated with the invariant states, to distinguish them from the dual variables 
p associated with arbitrary states n. This distinction will be useful in our 
proof of convergence to a diffusion process (see Lemma 7.5), where we need 
to distinguish the dual variables associated with actual system states from 
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the dual variables associated with nearby points on the invariant manifold. 
It is important to make this distinction because when a system state is near 
an invariant state and some component of the system state is near zero, it 
need not follow that the dual variables associated with the two states are 
close. 

Proposition 4.2. For each w G M+, A(w) £ M a , that is, A(w) is an 
invariant state. 

Proof. Let w € M+. Since A(w) is the unique optimal solution to (25), 
it follows from Lemma 6.4 of Kelly and Williams [26] that there exists q £ 
such that A(w)i = Pi(J2j^jQj^ji/ K i) 1 ^ a f° r au * £ I- Then, by Theorem 4.1, 
we have that A(w) is an invariant state. □ 

4.3. Asymptotic properties of fluid model solutions. The next three propo- 
sitions note some properties of fluid model solutions that follow from the 
analysis in [26] and that are used in our proof of multiplicative state space 
collapse (see Theorem 5.1 below). 

Proposition 4.3. For each Re (0,oo), there is a constant D(R) G 
[R, oo) such that for any fluid model solution n(-) satisfying \n(0)\ < R, we 
have \n{t) \ < D(R) for all t > 0. 

Proof. The proof of this proposition is implicit in the proof of Theorem 
5.2 in [26]. □ 

The next proposition states that fluid model solutions converge uniformly 
to the invariant manifold A4 a , where the uniformity applies across all fluid 
model solutions that start inside a compact subset of Mil. 

Proposition 4.4. Fix R e (0, oo) and e > 0. There is a constant Tr i£ 6 
[l,co) such that for each fluid model solution n(-) satisfying \n(0) \ < R, we 
have 

(31) d(n(t),M a ) < e for all t > T R , £ . 

Proof. The content of this proposition is the same as that of Theorem 
5.2 in [26]. □ 

Proposition 4.5. For each R e (0, oo) and e > 0, there exists 5 > 
such that for any fluid model solution n(-) satisfying |n(0)| < R and d(n(0),M a ) < 
5, we have d(n(t),A4 a ) < e for all t>0. 
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Proof. This proposition follows from the proof of Theorem 5.2 in [26]. 
For completeness, we provide a few details. Fix R > and e > 0. By Propo- 
sition 4.3, there exists a compact set B(R) in such that n(t) G B(R) for 
all t > for any fluid model solution n satisfying |n(0)| < R. Let D = {u G 
B(R) :d(u,A4 a ) > e}. As shown in [26], there is a continuous function H : 
— > that is zero on A4 a and strictly positive off Ai a such that H(n(-)) 
is nonincreasing for each fluid model solution n. Let 5i = inf {H{u) : u G £)}. 
Then, #i > and, by the properties of H , there exists 5 > such that when- 
ever n is a fluid model solution satisfying |n(0)| < R and d(n(0),M a ) < <5, 
we have H(n(0)) < 6±. Since H(n(-)) is a nonincreasing function, it then 
follows that H(n(t)) < 6\ for all t > 0. The latter implies that n(t) ^ D for 
all t > and so d\n(t),M a ) < e for all t > 0. □ 

The following corollary shows that fluid model solutions starting on the 
invariant manifold stay at their starting points for all time. We note this for 
the reader's interest. We do not use this corollary in our proofs. 

Corollary 4.1. Suppose that n(-) is a fluid model solution such that 
n(0) G M a . Then, n(t) = n(0) G M a for all t > 0. 

Proof. By Proposition 4.5, since d(n(0),M a ) = 0, we have d(n(t),M a ) = 
for all t > and hence n(t) G M a for all t > 0. It follows from Theorem 
4.1 that for each f > 0, A,;(n(i)) = p,; = Vi/ni for all i such that rii(t) > 0. 
Then, by the fluid model dynamics (22), for each regular point t > of n(-), 
we have 

(32) 4: n i( t ) = u i~ ViPi = 

dt 

if rij(i) > and the last equality also holds if nj(t) = 0. It follows, since 
the absolutely continuous function n(-) can be recovered from its almost 
everywhere defined derivative, that n(t) = n(0) for all t > 0. □ 

5. Main results. In this section, we describe the main results of this 
paper. We begin with our result on multiplicative state space collapse. This is 
established using the asymptotic behavior of fluid model solutions described 
in Section 4.3. Loosely speaking, multiplicative state space collapse shows 
that an approximation for iV r can be derived from one for W r via the 
continuous lifting map A [see (25)-(26) for the definition of this map]. This 
lifting map can be nonlinear (for a/1). The multiplicative state space 
collapse result leads to a natural conjecture for a diffusion approximation to 
W r . In the case a = 1, assuming a mild local traffic condition and suitable 
initial conditions, we prove that the conjectured diffusion approximation is 
valid. (In Section 5.6, we indicate some of the challenges associated with 
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establishing this conjecture for When a = 1 and all of the weights 

for the bandwidth sharing policy are equal (proportional fair sharing), we 
use results of Harrison and Williams [17] and Williams [41] to show that 
the diffusion has a product form invariant measure. When this measure has 
finite total mass, this result suggests an approximation for the stationary 
distribution of the flow count process which we are able to extend to the 
case where the document size distributions are finite mixtures of exponential 
distributions and to some models with multi-path routing. So as not to 
disrupt the flow of results and associated discussion, we defer the rather 
lengthy proofs of multiplicative state space collapse and of the diffusion 
approximation to Sections 6 and 7, respectively. 

5.1. Multiplicative state space collapse. 

Definition 5.1 (Multiplicative state space collapse). Multiplicative state 
space collapse holds (for the sequence of flow-level models described in Sec- 
tion 3), if, for each T > 0, 



in probability as r — > oo. 

Remark 5.1. We note here that in our form of multiplicative state space 
collapse, the normalization (in the denominator) is in terms of the flow count 
process, whereas in Bramson's version for multiclass queueing networks [5], 
it is in terms of a workload process. Furthermore, the lifting maps in [5] are 
all linear, whereas here, A can be nonlinear (for a / 1). 

Remark 5.2. If (33) holds without the factor in the denominator, then 
state space collapse is said to hold. Multiplicative state space collapse is more 
convenient for the purpose of verification and if {N r } (or {VT r }) satisfies 
a compact containment condition, then state space collapse follows from 
mutiplicative state space collapse. As was the case for open multiclass HL 
queueing networks [42], in establishing our diffusion approximation result 
for a = 1 under a mild local traffic condition, we will show for this case that 
multiplicative state space collapse implies state space collapse. 

The following theorem is one of the main results of this paper. It is proved 
in Section 6. 

Theorem 5.1. Assume that 



in probability as r — > oo. Multiplicative state space collapse then holds. 



(33) 



N r (.)-A(W r (-))\\ T 

||iH)llTVl 







(34) 



N r (0)- A(W r (0))|->0 
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5.2. Conjectured diffusion approximation. We are interested in obtaining 
a diffusion approximation for the scaled workload process W r . The multi- 
plicative state space collapse result can then be used to obtain a diffusion 
approximation for the scaled flow count process N r . 

For each r, define the double fluid scaled bandwidth allocation process 

T r (r 2 t) 

(35) T r (t) = ^^, t>0. 

Using (5)— (7) and (10) for the rth system, the definitions of rescaled pro- 
cesses and (ii) of Proposition 2.1, after some simple manipulations, we ob- 
tain, for all t > 0, 

(36) W r (t) = W r (0) + X r (t ) + U r (t ) , 
where 

(37) X r (t) = A(M r )~ 1 (E r (t) - S r (T r (t))) + r(Ap r - C)t, 

(38) (^r(t)))i = S[(f[(0), iel, 

U r (t) = r(Ct-AT r (t)) 

(39) =r~ l [ \c -Ak(N r (s)))ds 

Jo 

= r [\c-AA(N r {s)))ds. 
Jo 

If we postulate that multiplicative state space collapse implies state space 
collapse, then, by formally (nonrigorously) passing to the limit in the ex- 
pression (36) for W r , we can obtain a natural conjecture for a diffusion 
approximation to W r . Immediately below, we give an informal description 
of how one might arrive at this conjecture. Following that, we give a precise 
mathematical description of the diffusion process and of the conjecture. 

For the following informal description, which is used to motivate the form 
of the conjectured diffusion approximation, we postulate that the sequence 
of processes {(W r ,T r , U r , E r , 5 ,r )} converges in distribution to a 5-tuple of 
continuous processes (W,T*,U,E,S). We also postulate that state space 
collapse (SSC) holds (not just multiplicative state space collapse). From the 
convergence of {VF r }, SSC and continuity of the lifting map A, it follows 
that N r converges in distribution to a continuous process N = A(W) that 
lives on the invariant manifold A4 a . The fact that W r = A(M r )~ 1 N r will 
yield in the limit that W = AM N. By the characterization of invariant 
states given in Theorem 4.1, it will then follow that for each t > and 
realization u, there exists q(t,u) € such that 

(40) ff iM = p i ( p^^^ y /a far allien. 
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Consequently, W = AM N will live in the space 

(41) W a = AM' 1 M a , 
where 

(42) M a = jn 6 R+ : m = pi O^^ j ' for a11 * G ^ some Q € M + 

We call W a the workload cone. This is the state space for the conjectured 
diffusion approximation W. 

By the assumption made at the end of Section 3, we know that W r (0) 
converges in distribution, independently of the primitive arrival and ser- 
vice processes. We denote the limit distribution of W r (0) by rj. Under the 
postulated convergence of W r , rj will be concentrated on W a . 

We now turn our attention to the term X r in the expression (36) for 
W r . Given the functional central limit theorem result (21) for the diffusion 
scaled arrival and service processes (E r ,S r ), and the heavy traffic Assump- 
tion 3.1, if we postulate that the double fluid scaled allocation processes 
T T achieve the nominal levels given by T*(t) = pt in the heavy traffic limit, 
then X r given by (37) will converge in distribution to the Brownian mo- 
tion AM-\E{-) - 5(T*(-))) + 0(-), where (5(T*(0))i = Si(T*(-)) for i £ I, 
6{t) = Ot for all t > and 9 is defined in the heavy traffic Assumption 3.1. 
This Brownian motion starts from the origin, and has drift 6 and covari- 
ance matrix ^4M~ 1 diag(z^ + v)M~ l A\ where we have used the facts that 
M = diag(/i) and pipi = v\ for all i G I to compute the second term in the 
diagonal part of the covariance matrix expression. 

On examining the representation (36) for W r , we see that it remains 
to conjecture properties for the postulated limit U of the scaled unused 
capacity process U r as r — ► oo. The limit U will inherit the nondecreasing 
property from the U r . The main issue is to determine where each of the 
components of U can increase. For the prelimit process, U r , to determine 
where its components increase, we see from (39) that it suffices to identify 
where each of the components of C — AA(N r (■)) is strictly positive. From 
Proposition 2.1, if j E J such that Cj — Y^iei AjiAi(N r (t, u)) > 0, then there 
is a Lagrange multiplier p r (t,u>) 6 such that p T j(t,uj) = and 

(43) Nl (t , U ) = A, (N r (t, U ) ) ( EfcgJ ^ "^ Y" far alii el 

[For this, we note that both sides of (43) are zero if N[(t, u) = 0.] If A(N r (t, u)) 
is close to the nominal allocation p, then, by (43), N r (t,uj) is near 

M{ = jn € R\ : m = Pi (^^) ' for all i G I, 
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(44) 

some q £ satisfying qj = > , 

the subset of M a obtained by setting qj equal to zero in (42), and then 
W r = A(M r )~ l N r will be close to 

(45) Wi = {AM- 1 n:neM{}, 

which we refer to as the jth face of the workload cone W a . Thus, one might 
conjecture that in the limit, Uj can increase only when W is on the face 

Combining all of the above considerations leads to the informal conjecture 
that W r converges in distribution to a J-dimensional diffusion process W 
of the form Pv'(O) + X + U. The state space for W is the workload cone W a 
and the initial distribution of W is given by r/. In the interior of W a , the 
increments of W are given by the increments of a Brownian motion X with 
drift 6 and covariance matrix 

(46) T = 2AM~ 1 diag(u)M~ 1 A', 

that starts from the origin. The process W is confined to the cone W a 
by instantaneous "pushing" at the boundary of W a . The direction of push 
allowed when W is on the boundary face W 3 a is 7 J , the unit vector parallel to 
the positive jth coordinate axis in R+, and the cumulative amount of push in 
that direction is given by the jth component of the continuous nondecreasing 
process U. (At the intersections of boundary faces, the combined effect of 
pushing on the individual boundary faces is a push in the direction of a 
convex combination of the pushing directions available from the intersecting 
boundary faces.) The direction of push on a given boundary face is usually 
called a direction of reflection, whereas it might be better thought of as a 
direction of regulation for the process. (The term "reflection" comes from the 
fact that in one dimension, when the drift is zero, the construction of such 
a regulated process from a Brownian motion can be achieved by a mirror 
reflection; although this type of construction does not generally apply in 
higher dimensions, the term "reflection" is still used.) 

We now introduce a precise definition for the conjectured diffusion ap- 
proximation to the workload process W r . (The delicate issue of existence 
and uniqueness for this process is discussed below.) This definition will be 
used in giving a precise statement of our conjecture. Here, 9 is a vector in 
]R J , r is given by (46), 7 J is the unit vector parallel to the positive jth 
coordinate axis and r\ is a Borel probability measure on W a . 

Definition 5.2. A semimartingale reflecting Brownian motion that 
lives in the cone W Q , has direction of reflection 7 J on the boundary face 
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W£ for each j £ JJ, has drift 6 and covariance matrix T, and has initial dis- 
tribution 7] on W Q is an adapted, J-dimensional process W defined on some 
filtered probability space (CI, J 7 , {^i},P) such that 

(i) P-a.s., W(t) = W(0) +X(t) + U(t\ for all t > 0, 

(ii) P-a.s., W has continuous paths, W(t) G W Q for all £ > and W(0) 
has distribution rj, 

(iii) under P, 

(a) X is a J-dimensional Brownian motion starting from the origin with 
drift 6 and covariance matrix V; 

(b) {X(t) — 9t, Tt,t > 0} is a martingale, 

(iv) for each j € J, Uj is an adapted, one-dimensional process such that 
P-a.s., 

(a) §(0) = 0; 

(b) Uj is continuous and nondecreasing; 

(c) Uj(t) = /* l {W(s)eWi } dUj(s) for all t > 0. 

Remark 5.3. We call a process W satisfying the above properties an 
SRBM associated with the data (W a ,9,Y,{^ : j 6 J}, rj). Here, "adapted" 
means adapted to the filtration {J-t}- We note that this filtration need not 
be the one generated by X — it can be larger. However, it can always be 
taken to be the filtration generated by W,X,U. The martingale condition 
on X is included here as we are using a "weak" definition of the process. 
This martingale property is needed in establishing uniqueness in law for an 
SRBM. The term "semimartingale" refers to the fact that W is the sum of 
a continuous martingale and a continuous process that is locally of bounded 
variation. Condition (iv)(c) corresponds to the condition that Uj can only 
increase when W is on the boundary face W^. 

We now give a precise statement of our conjecture. 

Conjecture 5.1. Suppose that the limit distribution ofW r (0) is r), a 
probability distribution on W a endowed with the Borel a-algebra, and sup- 
pose that 

\N r (0) -A(W r (0))\ ^0 in probability as r -> oo. 

Then, W r converges in distribution as r — > oo to a process W that is an 
SRBM associated with the data (W a , &, {l 3 '-j G J},??)- 

We shall prove that Conjecture 5.1 holds when a = 1, provided that a mild 
local traffic condition holds. We state this result in the next subsection. In 
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the remainder of the current subsection, we indicate some of the challenges 
associated with constructing a rigorous proof of the conjecture. 

When a = 1 (corresponding to weighted proportional fair sharing), we 
can express the cone W a in the simple form 

(47) Wi = {ABA'q:q£R 3 + }, 

where B is an I x I diagonal matrix with the ith diagonal entry being 
-*r— > 0. Thus, Wi is a polyhedral cone. Since A has full row rank and 

B is a diagonal matrix with strictly positive diagonal entries, ABA' is a 
linear bijection between Ri and W±. It follows from this that Wi is a simple 
polyhedral cone. Necessary and sufficient conditions for the existence and 
uniqueness in law of SRBMs living in simple polyhedral domains have been 
given by Dai and Williams [10]. Under these conditions, the SRBM is a 
diffusion, that is, a continuous strong Markov process. It will turn out that 
the conditions of [10] are satisfied by our data when a = 1. For a^l and 
J = 2, the workload cone is a wedge which is still a simple polyhedral cone. 
However, in general, for and J > 2, the workload cone W a is not a 

polyhedral cone (it has curved boundaries). In this case, we have some partial 
(unpublished) results on existence and uniqueness for SRBMs. The main 
impediment to obtaining a general result is that boundary faces can meet 
in cusp-like singularities, making it challenging to even determine whether 
the process can escape from the cusp and whether it can do so in a unique 
manner (see Section 5.6 for an example). 

Even if one has existence and uniqueness of the SRBM, for any proof of 
the conjecture, there are a number of other challenges to overcome. First, one 
needs to establish C-tightness of the sequence of triples {(IF r , X r , U r )}. This 
is largely an issue of the C-tightness of {U r }. One also needs to show that 
multiplicative state space collapse implies state space collapse. One of the 
most challenging aspects is to show that for any possible limit (W,X,U) of 
the sequence {(W r , X r , U r )}, for each j € J, the process Uj can only increase 
when W is on the boundary face W 3 a . Indeed, in our informal use of (43) to 
arrive at our conjecture, we neglected the fact that Ai(N r (t,uj)) need not 
be near pi when some component of N r (t,ui) is near zero [recall that Aj(n) 
need not be continuous when 7ij is zero]. Also, the notions of "nearness" 
and "closeness" used loosely in our informal description are not necessarily 
uniform. However, if N[(t,u>) is at or near zero when pj(t,u) = for an i 

such that Aji > 0, then we can show that W r (t,uj) is near the boundary face 
yV 3 a - To take advantage of this observation, in the case when we prove the 
conjecture (a = 1), we will assume that a mild local traffic condition holds. 

In summary, the main reasons that we are able to treat the case a = 1 
are that the existence and uniqueness theory for the limit diffusion process 
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is in place [10] and there is an associated invariance principle [20] which, 
loosely speaking, is a perturbation result telling us that processes such as 
W r , that satisfy perturbed versions of the defining conditions for an SRBM, 
are close in distribution to an SRBM. In particular, for a = 1, the invariance 
principle of [20] takes care of establishing the C-tightness of {(W r ,X r , U r )} 
and, in the presence of the uniqueness in law of the SRBM [10], it implies 
convergence in distribution of W r to an SRBM. In the case a/1 and 3 = 2, 
W a is a wedge (a polyhedral cone) and our proof for a = 1 can be extended 
to this case. In as yet unpublished work, we have been able to establish 
uniqueness in law of the SRBM and to establish an invariance principle for 
some cases where and J > 2. However, some cases, especially when 

boundary faces meet in cusp-like singularities, are as yet unresolved. We 
summarize the situation for a ^ 1 in Section 5.6. However, because of the 
partial nature of our results so far, we leave the description of these further 
developments to future work. 

5.3. Diffusion approximation for weighted proportional fair sharing (a = 
1 ). The following condition is used in the next theorem. This condition can 
be interpreted as a local traffic assumption, under which each resource has 
at least one route that only uses that resource. 

Assumption 5.1 (Local traffic). For each j 6 J, there exists at least one 
i £ I such that Aji > and = for all j. 

The following theorem is proved in Section 7. 

Theorem 5.2. Assume that a = 1 and that the local traffic Assumption 
5.1 holds. Suppose that the limit distribution of W r (0) as r — » oo is i] (a 
probability measure on Wi) and that \N r (0) — A(W r (0))\ — > in probability 
as r — > oo. Then, (W r ,N r ) converges in distribution as r — > oo to a continu- 
ous process (W, N), where W is an SRBM with data (Wi, 0, T, : j 6 J}, rj) 
and N = A(W). 

In the case a = 1 , the lifting map A is in fact a linear map on Wi , given 

by 

(48) A(w) = diag(p) diag^)" 1 A '(AB A') -1 w , w 6 Wi. 

Indeed, for a = 1 and w £ Wi, if q = (ABA')~ 1 w and n is given by the 
right-hand side of (48), then w(n) = AM~ l n = w, by the definition of B = 
M~ x diag(i/) diag(K) _1 M _1 , and n = diag(/3) Ai&g^)' 1 A' q so that (30) holds. 
Then, by Theorem 4.1, we conclude that 

n = A(w(n)) = A(w) 
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Fig. 1. A linear network with two resources and three routes. 



and, hence, (48) holds. By the remark following Theorem 4.1, the (qj : j € J) 
defined above are dual variables for the optimization problem (1). It follows 
that we can associate a process of dual random variables Q with the SRBM 
W of Theorem 5.2, as follows. Given W as in Theorem 5.2, define 

(49) Q = (ABA')~ 1 W. 

This process Q inherits an SRBM structure from W. In fact, Q is a semi- 
martingale reflecting Brownian motion living in Ri, having the form 



(50) Q(t) = Q(0) + {ABA')~ l X{t) + (ABA!)~ 1 U(t), 
where the Brownian motion {ABA')~ l X has drift {ABA') 



t>0, 
~ l 6 and covariance 



matrix [ABA') 1 T(ABA') , and Uj can increase only when Qj is zero, 
j € J. The direction of reflection on the boundary face of R^ is defined 
by the jth column of the matrix (ABA') -1 . The initial distribution of Q is 
obtained by applying the linear transformation (ABA') -1 to the distribution 
r\. (For the formal definition of such an SRBM, where reflection directions 
are not in general parallel to coordinate directions, see [43].) 

As an illustration of Theorem 5.2, consider a two-resource linear network 
operating under a weighted proportional fair sharing policy (a = 1). This 
network is depicted in Figure 1. It has two resources and three routes. Each 
resource has a route that passes only through that resource and there is also 
a route that passes through both resources. 

The workload cone in this case is a wedge in R^_ that has the following 
representation: 
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It can be easily computed that the two boundary faces of the wedge Wi 
have the following expressions: 

W\ = {w e R 2 + : wi > 0, w 2 = Piwx} 

and 

Wf = {w eR 2 + :w 2 >0,wi= f3 2 w 2 }. 

The wedge is depicted in Figure 2. For a linear network, the local traffic 
condition holds automatically. Thus, the conclusion of Theorem 5.2 applies, 
provided VF r (0) converges in distribution to a random variable with distri- 
bution r\ concentrated on Wi and |iV r (0) — A(H^ r (0))| — > in probability as 
r — > 00. For example, this holds if each of the systems indexed by r starts 
empty and n is the point mass at the origin in K/L The limiting SRBM W 
lives in the wedge Wi and is confined there by reflection (or pushing) at the 
boundary. Reflection occurs in the horizontal direction (corresponding to 
resource 1 underutilizing capacity) on the bounding face where W2 = ftiw\. 
The interpretation of this is that although there is no work for resource 1 on 
route 1, there is work for this resource on route 3. However, congestion at 
resource 2, through the nature of the bandwidth sharing policy, is prevent- 
ing resource 1 from working at its full capacity. Similarly, vertical reflection 
(corresponding to resource 2 underutilizing capacity) on the bounding face 
w\ = (32W2 is interpreted to mean that congestion at resource 1 is preventing 
resource 2 from working at its full capacity. Thus, the shape of the workload 
space indicates the entrainment of resources, whereby congestion at some 
resources may prevent other resources from working at their full capacity. 
We note that when K3 — > 00, the upper boundary of Wi tends to the ver- 
tical axis and the lower boundary tends to the horizontal axis, hence the 
wedge expands to the whole quadrant, approaching the situation with full 
utilization of the resources. 

5.4. Product form stationary distribution for proportional fair sharing. 
In this subsection, we prove a result which shows that when a = 1 and = 1 
for all i £ I (proportional fair sharing), an SRBM W with the properties 
described in Theorem 5.2 has a product form invariant measure. (This is a 
result for SRBMs and so does not require the local traffic condition a priori.) 

Theorem 5.3. Suppose that a = 1 and Ki = 1 for all i El. Let ir be 
the measure on Wi that is absolutely continuous with respect to Lebesgue 
measure with density given by 

(52) p(w) = exp(t> • w), w £ Wi, 

where 



(53) 



v = 2T- 1 6. 
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Fig. 2. The workload cone W 1 for a linear network with two resources and three routes 
is an infinite wedge in the positive quadrant of R . A finite portion of the wedge is shown 
above (between the two shaded regions). Under the lifting map A, points (wi,W2) on the 
boundary W2 — fliwi of the wedge are mapped to points (711,712, 713), where 71 1 = (and the 
corresponding q £ R+ has q\ — 0); similarly, points (wi,W2) on the boundary w± = P2W2 
are mapped to points (711,712,71-3), where 712 = (and the corresponding q € R+ has 52 = 0). 

The product form measure ir is an invariant measure for the SRBM having 
state space Wi, directions of reflection {"f J :j € J}, drift 6 and covariance 
matrix T . This measure is integrable over Wi if and only if Oj < for all j £ J 
and then, after normalization, it defines the unique stationary distribution 
for the SRBM. 

Proof. Sufficient conditions for a refiecting Brownian motion in a sim- 
ple polyhedral domain to have a product form invariant measure were de- 
termined by Williams in [41], building on solutions of a related analytic 
problem obtained by Harrison and Williams in [17]. In these works, the co- 
variance matrix for the process is the identity matrix. In order to apply 
these results, we need to perform a linear transformation to transform an 
SRBM with covariance matrix T into one whose covariance matrix is the 
identity matrix. We perform this transformation below; the computations 
are straightforward, though a little tedious, as we need to normalize the 
resulting directions of reflection to have inward normal components of unit 
length, to facilitate use of the results in [41]. Similar manipulations were car- 
ried out in the proof of Theorem 23 in [18], where the reflection directions 
had a special form. 

Before introducing the transformation, we obtain an alternative represen- 
tation for the simple convex polyhedron W\. By (47), 

Wi = {ABA'q : q E R J + } = {w £ 1 J : (ABA'^w G R 3 + }. 
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Here, since Ki = 1 for all i, we have that B = M 1 diag(i/)M 1 and, by (46), 
ABA' = ±I\ Thus, 

(54) Wi^lweR^r^elj}, 

where we have used the fact that q G Mi if and only if 2q G . 

We now define the linear transformation to be applied to the SRBM. Let 
T denote the diagonal matrix that has the same diagonal entries as T . 
Let L be the rotation matrix whose rows are the orthonormal eigenvectors 
of the covariance matrix T and D be the corresponding diagonal matrix 
of eigenvalues such that T = L'DL, where V = L . Let W be an SRBM 
with state space Wi, drift 9, covariance matrix T and directions of reflection 
given by {7 J : j G J}, with a decomposition as in (i) of Definition 5.2. Let 
V = D~ l / 2 L and define Z = V W. Then, Z is an SRBM in the simple convex 
polyhedron 

Z x = {Vw : w G Wi} = {z € M J : V~ l z € Wi} 
= {zGl J :rV" 1 zGl|} 
= {zGR J :ii J 'Z>fl for all j G JJ}, 
where, for each j G J, n- 3 is given by the jth row of the matrix 

(55) 9 = T -l/2 r -l F -l = T -X/2 T -l L > D X/2 

and n J is the unit inward normal to the jth face of Z\. (The matrix Y -1 / 2 
is used to normalize the n 3 to be of unit length.) The process Z inherits the 
following decomposition from W: 

(56) Z(t) = Z(0) + VX(t) + RY(t), t>0, a.s., 

where R = VH^ 1 = D -1 / 2 LH -1 , = T 1//2 is the diagonal matrix with the 
same diagonal entries as QV and Y = HU is a continuous, nondecreasing 
process that starts from zero and can increase only when Z is on the jth 
face of Z\ . The columns of the matrix R are the directions of reflection for Z 
on the faces of Z\ , normalized so that the inward normal component of each 
direction of reflection has unit length. (The matrix H" 1 is used to achieve 
the normalization.) The matrix R has the form 

(57) R = 0' + H', 

where is the matrix specified in (55) (whose rows are the inward unit 
normals to the faces of Z\) and S is the matrix whose rows consist of the 
components of the (normalized) directions of reflection that are tangent to 
each of the faces of Z\. In particular, the diagonal entries of GH' are all 
zero. In this context, the sufficient condition given in [41] for existence of 
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a product form invariant measure for Z is the so-called "skew symmetry 
condition," 

(58) e~' + ~e' = o. 

The vector form of (58) is 

(59) n* • P + t ■ n j = for all i, j G J, 

where, for each iG J, n* is the inward unit normal to the iih face of Z\ and 
s l is the tangential component of the (normalized) direction of reflection 
on that face. A geometric interpretation of the skew symmetry condition 
is provided in [17]. We refer the reader to that paper, especially Section 
9, for a full discussion. Briefly, for a simple convex polyhedron as we have 
here, the skew symmetry condition can be shown to be equivalent to a 
local condition. This local condition can be stated in words as requiring 
that near the intersection of any two distinct faces of Z\ , the component of 
the reflection vector on one face that is directed toward the intersection of 
the faces is balanced on the other face by the component of the direction 
of reflection for that second face which is of the same magnitude as the 
component on the first face, but it is directed away from the intersection 
of the faces. Indeed, under the skew symmetry condition, with probability 
one, the SRBM Z will not hit the intersection of any two faces when started 
away from that set, as is proved in [41]. 

The left-hand side of condition (58) is equal to 

9{R - 9') + (R' - 9)9' = -299' + 9R + R'9' 

= _ 2T - 1 /2 r -l L / £ ,l/2 £) l/2 Lr -l T -l/2 

+ T- 1 ' 2 T- 1 L'D 1 ' 2 D- 1 I 2 LH- 1 
+ H- 1 L'D- 1 / 2 D 1 / 2 LF- 1 T- 1 / 2 
= _2Y~ 1 / 2 r~ 1 Y~ 1 / 2 

= o, 

where we have used the facts that V = L'DL, L'L = I and H = Y 1 / 2 . 

Thus, the skew symmetry condition (58) holds and it follows from The- 
orem 1.2 of [41] that the SRBM Z has a product form invariant measure 
with a density relative to Lebesgue measure that is proportional to exp(/3 • z) , 
z G Z\, where 13 = 2(1- 9~ l '~)- 1 V6 = 2V6. Note, for this, that 

Q-1 S = D -l/2 LTT X/2^ H -l L f D -l/2 _ T -l/2 r -l L / jD l/2 ) 

= £,-1/2^x1/2^-1/2^-1/2 _ T -l/2 L ' D -l LL ' D l/2) 
= D -V2 LT rV2(T-l/2 L > D -l/2 _ T -l/2 L /£-l/ 2) 
= 0, 
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where we have used the facts that T _1 = L'D" 1 L and L'L = I. Furthermore, 
by Corollary 1.1 of [41], if the exponential density is integrable over Z\, then, 
after normalization, it yields the unique stationary distribution for Z. 

By inverting the linear transformation V, we can transform this result 
back to one for W . Noting that for z = Vw, 

(60) /9 • z = P ■ Vw = 26'V'Vw = 29'T^w, 

we conclude that (52) is an invariant density for the original SRBM W and 
if this is integrable over Wi, then, after normalization, it yields the unique 
stationary distribution for W. Using the representation (54) for Wi, we see 
that this density will be integrable over Wi if and only if exp(# ■ q), q € 
is integrable over , which occurs if and only if 9j < for each j E X □ 

Remark 5.4. The product form invariant measure of Theorem 5.3 is 
remarkable, yet the proof of the result gives little insight into why the re- 
flection directions and covariance matrix in the particular case a = 1, Ki = 1 
for i = 1, . . . ,1 should allow the result to hold. We simply note here that 
the authors first suspected that a product form result might be found after 
observing that it is possible to describe a network of queues with a product 
form stationary distribution whose conjectured Brownian model approxima- 
tion has the same directions of reflection and covariance matrix as the Brow- 
nian model approximation under study in this paper, provided that a = 1 
and Ki = 1 for i = 1, . . . , I, that is, the case of proportional fair sharing. Ear- 
lier connections between product form queueing networks and proportional 
fairness have been explored by [4, 34] and the relationship between these 
several connections seems a rich area for further study. 

The product form of the density (52) does not imply that, when 9j < 
for all j 6 J, the components of the SRBM W are independent under the 
stationary distribution for the SRBM since, in general, the cone Wi is not 
an orthant. Independence can, however, be deduced for the components of 
the SRBM Q of dual variables. 

Corollary 5.1. Suppose that the assumptions of Theorem 5.2 hold, 
that Ki = 1 for all i G I and that 6j < for all j € J. Let (W, N) be the 
process identified in Theorem 5.2. The SRBM Q = 2F~ 1 W of dual variables 
then has a unique stationary distribution and this distribution has a density 
relative to Lebesgue measure that is proportional to exp(# • q), q £ ffi^ . Under 
this stationary distribution, the components of Q are independent and Qj is 
exponentially distributed with parameter —9j for each j £ J. 

Proof. This result is immediate from Theorem 5.3 upon applying the 
linear transformation (ABA')" 1 = 2T" 1 to transform W into Q. □ 
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Under the assumptions of Corollary 5.1, from Theorem 5.2, (48) and the 
definition of Q, we have that 

(61) N = diag(p)A'Q 

and it follows that the stationary distribution of N can be expressed as a 
linear combination of independent exponential random variables. Thus, re- 
source j has associated with it a dual random variable Qj, for j 6 J (here, 
the superscript of s signals that the random variable is associated with the 
stationary distribution). These dual variables are independent and expo- 
nentially distributed with parameters — 8j for j G J and under its stationary 
distribution, the ith component of N is proportional to the sum of the dual 
variables associated with the resources used by route i. This suggests the 
following simple approximation for the stationary distribution of the un- 
sealed network, that is, the flow-level model of Section 2. The stationary 
approximation is 

(62) N? « Pi^QjAji, 

where Qj,j € J, are independent and Qj is exponentially distributed with 
parameter Cj — J2i&AjiPi- We want to emphasize that (62) above is merely 
a formal approximation involving various implicit assumptions such as exis- 
tence of a stationary distribution for N and formal unraveling of the heavy 
traffic scaling and limit procedure. In particular, this involves an interchange 
of limits and the approximation in (62) involves errors that may be substan- 
tial (e.g., of order r). However, the following observation of Massoulie and 
Roberts [33], which yields an exact stationary distribution for an unsealed 
linear network, suggests that there is cause for optimism regarding the ap- 
proximation (62). 



Example 5.1. Consider a linear network with J resources, where the 
set of resources is J = {1,2, ... , J}. Let the set of routes be labeled I = 
{0, 1, 2, . . . , J}, where we use the symbol i = for the route {1,2,...,J} 
through every resource and, for i = 1,2, . . . , J, we use the symbol i for the 
route {i} through the single resource i. The local traffic Assumption 5.1 
thus holds and we assume a = 1 and Ki = 1 for all i El. Suppose that 
Cj = l,j = 1, 2, . . . , J, and that po + pj < 1, j = 1, 2, . . . , J. The stationary 
distribution for (Nq,Ni, . . . , Nj) is then given by [33] 

/roN , \ nj=i(i-po-Pj) ( J2 n i\ rr n, 
(63) 7r(no,ni,...,nj) = (1 _^ )J _ 1 ^ I lift > 
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where no, ■ •■ ,nj each run through the nonnegative integers. Summing this 
formula over no and using the negative binomial expansion gives the marginal 
distribution for (Ni, JVjj, . . . , iVj) as 



Thus, under the stationary distribution, N±,N2, ■ ■ ■ , iVj are independent and 
Ni is geometrically distributed with mean pi/(l — po — Pi)- This accords re- 
markably well with the approximation (62), under which Ni would be ap- 
proximated by an exponentially distributed random variable with the same 
mean. 

For this particular example, as observed by [33], the Markov chain N is 
reversible and, consequently, the stationary distribution ir for N is insensi- 
tive to the document size distributions, depending only on their means. (For 
a description of general system structures to which such insensitivity results 
apply, see [37] and references therein.) 

As a complement to the insensitivity result mentioned in the above ex- 
ample, we note that the stationary distribution result of Corollary 5.1 can 
be extended to the situation where the document sizes are finite mixtures 
of exponentials. Indeed, such a flow-level model may be realized by col- 
lapsing an extended (exponential) model. The extended model is obtained 
by splitting each route in the original model into finitely many "copies" so 
that each copy uses the same set of resources and has the same weights 
as the original route, but where each copy has its own Poisson arrival pro- 
cess and distinct exponential document size distribution. The relative arrival 
rates for the copies determine the proportions for the mixture of exponen- 
tial distributions associated with the original route. More precisely, consider 
an extended (exponential) flow-level model in which, for each i = 1,...,I, 
routes i(l),i(2), . . . ,i(Ki) identify finitely many identical routes (subsets of 
the resources in J) whose associated weights are also identical. In the fol- 
lowing, parameters and processes associated with this extended model will 
have a superscript of ' appended. In particular, N^ will denote the flow 
count process. 

Under the assumptions of Theorem 5.2 (heavy traffic, a = 1, local traffic 
condition and initial state space collapse) for the extended flow-level model, 
there is an SRBM approximation W' for the workload and an attendant 
approximation iV"' = A(W^) for the flow count process. Furthermore, there 
is an SRBM process of dual random variables. Under the additional 
assumptions of Corollary 5.1 (all of the weights are one and 9 j < for each 
j S J), the stationary distribution of is such that the components are 
independent and Qj is exponentially distributed with parameter — #j for 



(64) 
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each j G J. Thus, under the assumptions of Corollary 5.1, the stationary 
distribution of 

(65) iVt = diag( / 9 t )(^ t ) / Q t 

can be expressed as a linear combination of exponential random variables. 
For the extended model, the Poisson arrival rate for route i(k) is 

and the exponential service time parameter for this route is p\/ k y Then, the 
I-dimensional collapsed process N defined by 

Ki 

(66) N i = T, N i(k)> i = l,-,I, 

fe=i 

has the same distribution as the flow count process in a flow-level model 
with I routes, where the Poisson arrival rate for route i = 1, . . . , I is 

( 67 ) = E u i(k) 

k=i 

and the document size distribution for route i = 1, . .. ,1 is a mixture of 
exponentials with parameters and proportions v \^l v i f° r k = l,...,Ki. 
The mean of this document size distribution is , where pi is defined by 

i ^4*) 1 



( 68 ) - = E 



^ fc=i Vi Aw 

for i = 1, . . . , I. Let pi = v%j ili for £ = 1, ... , I. 

Now, consider the exponential analog of the collapsed flow-level model 
where the finite mixture of exponential distributions for route % is replaced 
by a single exponential distribution with parameter //j for % = 1, . .. ,1. The 
nominal load placed on resource j in the extended model is (A*p')j and 
this is the same as the nominal load (Ap)j placed on j in the exponential 
analog. It follows that a sequence of extended flow-level models satisfying 
the assumptions of Corollary 5.1, where the limiting value in (12) of the 
heavy traffic Assumption 3.1 is denoted by 9\ will have a parallel sequence 
of exponential analogs whose limiting value 9 in (12) will be precisely the 
same as 9* . Consequently, the stationary distribution of is the same as 
that for the process Q of dual random variables associated with the sequence 
of exponential analogs. 

Then, combining (65) and (66), we obtain the following as an approxima- 
tion for the collapsed flow count process: 

Ni = E P\ k) E A Uk)Q) = Pi E A A for i = 1, . . . , I, 

k=i jeJ jeJ 
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that is, N = diag(p)A'Q^ . Since the stationary distributions for and Q 
are the same, it follows that the stationary distribution for N is the same, 
whether one considers the collapsed flow-level model (where document sizes 
are distributed as finite mixtures of exponentials) or the associated exponen- 
tial analog (where document sizes are distributed as exponentials). Similarly, 
the formal approximation (62) for the unsealed network is the same for both. 
Note, however, that the stochastic processes W', N and Q ' , and, in particu- 
lar, their covariance matrices are, in general, different from those associated 
with the exponential analog. 

It is natural to conjecture an extension of the above insensitivity results 
to general document size distributions. However, even an extension to phase- 
type distributions would require generalization of the diffusion approxima- 
tion results to flow-level models with feedback and treatment of more general 
distributions would appear to require a significantly more elaborate stochas- 
tic model (see [14]) to keep track of residual document sizes. 

5.5. Multi-path routing. In this subsection, we describe a generalization 
of the earlier model that allows multi-path routing. In our initial description 
of the model, we shall use a different notation for the sets of routes and 
resources. This will allow our eventual results to be expressed in a notation 
consistent with that used elsewhere in the paper. 

Suppose that we now interpret i G I as a source-destination pair and 
let K be the set of routes. Let I = |I| and let K = |K|. We suppose that 
K is partitioned into I nonempty subsets, each associated with a single 
source-destination pair. Let = 1 if route k G K is associated with source- 
destination pair i G I and let = otherwise. Thus, H is an I x K matrix 
containing only zeros and ones, and J2i£i Hik = 1 for each k G K. 

There remain finitely many resources, but now labeled by I G L, with 
capacities (C 1 / : I G L) that are all strictly positive and finite. A route k is 
a nonempty subset of L (interpreted as the set of resources used by route 
k). We assume that L and IK are both nonempty and finite. Let L = [L|, 
the total number of resources. Let A be the L x K matrix containing only 
zeros and ones, defined such that An- = 1 if resource I is used by route k and 
Aik = otherwise. Our assumption that each route k identifies a nonempty 
subset of L implies that no column of A is identically zero. 

It is assumed that a new document arrives to source-destination pair i at 
each jump time of a Poisson process that has rate parameter Vi > and that 
each such document has an exponentially distributed size with mean 
where /ij G (0,oo). These document sizes are assumed to be independent of 
one another and to be independent of all arrival times of documents. Let 
Pi = Vi/pi,i£ I. 

Given a fixed parameter a G (0, oo) and strictly positive weights (ki :i G I), 
if iVj(t) denotes the (random) number of documents being transferred be- 
tween source-destination pair i at time t for each i G I and N(t) = (Ni(t) : i G 
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I), then the bandwidth allocated to source-destination pair i at time t is 
given by Ai(N(t)) and this bandwidth is shared equally among the iVj(t) 
transfers in progress between source-destination pair i. The function A = 
(Aj :i G I) :M+ — >■ R\ is defined such that for each n G M+, Aj(n) = for 
i G \{n) = \jn G I : n m = 0}, and when I+(ra) = {m G I : ra m > 0} is nonempty, 
A + (n) = (Aj(n) : i G is the unique value of A + = (Aj : i G I+(n)) that 

solves the optimization problem 

maximize G n (A + ) 

subject to MkVk <Q, I Eh, 

, s k 

(69) 

^H ik y k = Aj, i€l+(n), 

k 

over 2/fc>0, G K, A, > 0, iGl + (n), 

where G n (A + ) is again given by the definition (2). 

Without loss of generality, we may assume that in the solution of (69), 
y k = for those k such that Hn~ = for all i G I+(n). With this convention, 
allocations yk,k G K, associated with the unique optimal value A + (n) can 
be interpreted as bandwidth allocations for the routes which sum to give 
the bandwidth allocations to the source-destination pairs. 

The above optimization problem reduces to the earlier problem (1) in 
the case where I = K and H is the identity matrix, that is, the case of a 
single route for each source-destination pair. More generally, the following 
proposition allows us to reduce to the optimization problem (1), even in the 
multi-path case. This observation was previously made in [24], Section 3.3. 

Let 

y = {yeRK-.Ay<C}. 
Proposition 5.1. There exists a representation 
(70) Hy = ((Ai, i€l):Ai>0, i G I, ^%Ai<C i; jejj, 

where J, A and C can be chosen so that C has positive elements, A has 
nonnegative elements and no column of A is identically zero. 

Proof. The set y is the intersection of the half-spaces {y G M K : (Ay)i < 
Q}, I G L, and the nonnegative orthant, R5\ Since no column of A is iden- 
tically zero, the set y is bounded and is thus the convex hull of a finite 
number of extreme points. Hence, Hy is the convex hull of a finite number 
of extreme points or, equivalently, the bounded intersection of a finite set of 
half-spaces. 
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Next, we explore further the geometry of Hy. First, Hy C since the 
elements of H are nonnegative. Also, G Hy since G y. Indeed, for d 
positive and small enough, (Aj = 6i, i G I) G Hy for all < 6i < 5, i G I, since 
for e positive and small enough, (y k = s k , k G K) G 3^ for all < e k < e, k G K, 
and no row of is identically zero. Thus, Hy is bounded by the hyperplanes 
bounding the nonnegative orthant, plus finitely many other hyperplanes, 
none of which contains the origin. Thus, there exists a representation of the 
form (70) for some choice of JT, A and C. Since G Hy, the elements of 
C are nonnegative. Choose a minimal representation, where the set J is of 
minimal cardinality. Then, the intersection of the hyperplane 

(71) {(A i; ieI):J2 A ji A i = Cj\ 

with Hy must have a nonempty interior relative to the hyperplane, for each 

Next, we show that if A G Hy and < A < A, then A G Hy. Let y G y 
be such that A = Hy and, for each k G K, let i(k) be the unique index i such 
that H ik = 1. Let y£ = (A' m /A i{k) )y k for fceK. Then, y' G 3> and Fy' = A, 
so A G Hy. 

We have seen that the intersection of the hyperplane (71) with Hy has 
a nonempty interior relative to the hyperplane. Choose a point A in this 
relative interior. Then, Aj > for i G I. Fix i G I and choose A' so that 
< A^ < Aj, with A<, = Aj/ for i' G I \ {i}. Then, < A < A, so A G 
Hence, the hyperplane (71) must have Aji > 0. 

Finally, note that Cj > for j G J since the hyperplane (71) does not con- 
tain the origin, and no column of A is identically zero since Hy is bounded. 
□ 

The represention (70) of Hy allows us to elide the variables y from the 
optimization definition of A + (n) and to deduce that the unique solution 
(Aj :i G L|_(n)) to the optimization problem (69) is also the unique solution 
to the optimization problem (1). With this generalization to multi-path rout- 
ing, the matrix A may no longer only contain zeros and ones. However, the 
proofs of Theorem 5.2 and Corollary 5.1 still go through in this enhanced 
generality, provided that the local traffic condition is satisfied. 

As an illustration, consider the network depicted in Figure 3, operating 
under the proportional fair sharing policy (a = l,Kj = 1 for all i), where 
the labeled resource capacities satisfy C3 < C^CV It has three source- 
destination pairs, four resources and five routes. The matrices A and C 
can then be chosen as 
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These matrices may be viewed as expressing the generalized cut constraints 



apparent from Figure 3. In the first of these constraints, the capacities of 
the first and second resources are pooled. Resource pooling subject to gen- 
eralized cut constraints is a common phenomenon in stochastic processing 
networks with routing [29]. Theorem 3.1 of [29] provides an alternative form 
of the representation (70) and provides references to algorithms to calculate 
matrices A and C . 

In the above example, we have been able to construct the matrix A so 
that a subset of its columns forms the identity matrix and thus the local 
traffic Assumption 5.1 holds. However, in general, it is more difficult to 
verify the local traffic condition for networks with multi-path routing than 
for networks without routing choices. 

Suppose that the local traffic Assumption 5.1 holds for the matrix A. 
Define heavy traffic as in Assumption 3.1 and assume that 6j < for j £ J. 
Corollary 5.1 then applies directly. We can illustrate the formal approxi- 
mation (62) for the example shown in Figure 3: under the approximation, 
Qi and Q|; the random dual variables associated with constraints (72) and 
(73), respectively, are independent and exponentially distributed, Q\ with 
parameter C\ + C2 — p\ — P2 and Q2 with parameter C3 — \p\ — p 3 . 



(72) 
(73) 



Pi + P2 < Ci + C 2 

\P\ + P3<C 3 




Ml 



Fig. 3. A network with three source-destination pairs, four resources and five routes. 
Each of the first two source-destination pairs has two routes available to it. 
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5.6. Discussion of the conjecture for a^l. Consider the two-resource 
linear network depicted in Figure 1 , operating under a weighted a-fair band- 
width sharing policy for a / 1. Then, the workload cone W a is still a wedge 
in that has the following representation: 



(74) W a 



w 2 



^2 1/a , V3 , , _U/ a 



The two boundary faces of this wedge have the following expressions: 



and 




: w?i > 0, ^2 



: W2 > 0, 



1 + 2$2(5!)% 

H 3 V 3 V «2 / 



1 + 



^3^3 



1/a 



W-2 



In this case, since the two-dimensional wedge W a is still a polyhedral cone, 
the proof of Theorem 5.2 can be easily modified to apply for a/1, with 
W a used in place of W\ there. When the weights i = 1, 2, 3, are all equal, 
the wedge W a does not depend on a. However, when the not all 

equal, the wedge W a does depend on a. However, as a goes to infinity, the 
quotients involving the weights Kj in the expressions for W a ,j = 1,2, tend 
to one and there is a limiting wedge which is the same as that obtained 
when the all equal. On the other hand, as a tends to zero, the limit 

of the upper boundary depends on whether k 3 > k,2 (tends to the vertical 
axis) or k% < K2 (tends to the 45 degree ray from the origin). Similarly, for 
the lower boundary, if k 3 > K\, it tends to the horizontal axis and if k 3 < K\, 
it tends to the 45 degree ray from the origin. Hence, as a tends to zero, 
if ft 3 > max{fti, K2}, the wedge expands to the whole quadrant and if K3 < 
minj/ti, K2}, the wedge contracts to the 45 degree ray from the origin. Note 
that even when the wedge expands to the whole quadrant, the components of 
the diffusion workload do not become independent as the covariance matrix 
r, which does not depend on k or a, is not diagonal. 

In general, for o/l with higher-dimensional workloads, that is, J > 2, 
the shape of the workload cone W a depends on a, as well as on A, is, \i and 
k. This relationship appears to be quite complicated. We are investigating 
Conjecture 5.1 in this case. At this time, we only have some partial results. 
The main difficulty in establishing the validity of the conjecture concerns 
proving C-tightness of {H^} and establishing uniqueness in law for any 
weak limit point. To illustrate some of the difficulties involved, we consider 
a linear network with three resources and four routes. Although this is a 
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Fig. 4. A portion of the workload cone Wi is shown for a linear network with three 
resources and four routes with a = 1 and Ui = fj,i = m = 1 for all i 6 1. 

particular example, it exemplifies the challenges presented by the geometric 
data in establishing existence, uniqueness and an invariance principle for 
the SRBM. For this example, each of the three resources has a route that 
only goes through that resource and there is one additional route that goes 
through all three resources. Here, the workload cone is contained in the 
three-dimensional positive orthant. When = fj,i = k% = 1, for all igl, we 
depict the associated workload cone and one of its cross-sections, for a = 2, 
a = 1 and a = 0.5, in Figures 4-9. 

For the case depicted in Figures 4-5, a = 1 and Theorem 5.2 applies 
to justify the diffusion approximation. The proof of this theorem uses the 
uniqueness of the diffusion process [10] and the invariance principle devel- 
oped in [20]. For the case depicted in Figures 6-7, a = 2 and the workload 
cone has boundary faces that curve outward. Away from the origin, the 
boundary and directions of reflection for the proposed diffusion approxima- 
tion locally satisfy conditions required by the invariance principle given in 
[20]. However, the workload cone has a "singular point" at the origin where 
the conditions in [20] fail to be satisfied. However, since this is an isolated 
point, the invariance principle in [20] and the proof of uniqueness starting 
in [10] can be adapted to establish the SRBM diffusion approximation in 
this case. For higher-dimensional analogs of this case, we anticipate that a 
valid diffusion approximation can be established. However, as the dimen- 
sion increases, it becomes more difficult to compute the inward normals to 
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all boundary faces and, as yet, we do not have a systematic way to verify 
geometric sufficient conditions for uniqueness in law and a valid invariance 
principle. For the case depicted in Figures 8-9, where a = 0.5, the situation 
is more complex. Here, the workload cone has boundary faces that curve 
inward and any two faces meet in a cusp, that is, the inward normals to 
any two boundary faces point toward one another at the intersection of the 
faces. At present, we do not have an existence and uniqueness result, nor an 
invariance principle, to treat this case because of the singular geometry at 
the intersections of faces. In particular, it has not been established that the 
diffusion process can escape from the tip of a cusp or that there is uniqueness 
in law for a diffusion process starting there. 

In a recent work, Shah and Wischik [38] have proven multiplicative state 
space collapse for a class of "switched" networks operating under a family 
of scheduling policies related to the maximum weight algorithm introduced 
by Tassiulas and Ephremides [40]. Based on this, Shah and Wischik have 
conjectured natural diffusion approximations for the workload processes in 
these models. These diffusions are SRBMs living in cones with piecewise 
smooth boundaries. While these cones have some characteristics in common 
with those found for the bandwidth sharing model considered here, there 
are also some new features. Depending on a parameter associated with the 
family of scheduling policies, the cones of [38] include nonsimple convex 
polyhedrons as well as cones with curved boundaries where boundary faces 
can meet smoothly. The validity of these conjectured diffusion approxima- 
tions for input-queued packet switches operating under a maximum weight 
algorithm is being explored in [21]. 




Fig. 5. A cross-section of the workload cone Wi depicted in Figure 4 taken at u>3 = 1. 
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Fig. 6. A portion of the workload cone W2 is shown for a linear network with three 
resources and four routes with a = 2 and Vi = fii = m = 1 for all i 6 1. 

6. Proof of multiplicative state space collapse. In this section, we prove 
the multiplicative state space collapse result, Theorem 5.1. Our proof fol- 
lows a general line of argument pioneered by Bramson in [5], where open 
multiclass queueing networks operating under certain head-of-the-line (HL) 




Fig. 7. A cross-section of the workload cone W2 depicted in Figure 6 taken at 103 = 1. 
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Fig. 8. A portion of the workload cone Wo. 5 is shown for a linear network with three 
resources and four routes with a = 0.5 and Vi = = m = 1 for all i £ I. 

service disciplines are treated. However, there are some differences from [5]. 
Here, we have the more general structure of a stochastic processing net- 
work with simultaneous resource possession and our service discipline is not 
work-conserving. On the other hand, we have exponential interarrival and 
document size distributions (rather than general distributions), which lead 




Fig. 9. A cross-section of the workload cone Wo. 5 depicted in Figure 8 taken atW3 — l. 
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to some simplifications in our proofs. A particularly interesting aspect here 
is that, in contrast to prior results on state space collapse for open multi- 
class queueing networks, our lifting map can be nonlinear (for a/1). In 
addition, unlike Assumption 3.1 in [5], we do not require an exponential 
rate of convergence of fluid model solutions toward points on the invariant 
manifold; we only use uniform convergence of fluid model solutions toward 
the invariant manifold (starting in a compact set). Despite these differences, 
our main line of argument follows that of Bramson [5] . 

We first provide some preliminary results in Section 6.1. Our proof of 
multiplicative state space collapse is then given in Section 6.2. 

We note (for the extension to multi-path routing described in Section 5.5) 
that the proofs in this and the next section use the fact that the entries in 
the matrix A are nonnegative, rather than the stronger condition that they 
are zeros or ones. 



6.1. Crucial estimates for fluid scaled processes. Recall the definition of 
fluid scaled processes from Section 3. For each r > and m E Z + , let 

(75) n r m =\N r (m)\Vl 

and define shifted and scaled processes 'E r ' m ^j^' m ^Jf'" 2 ^ Jf r ' m ^ W r,m as 
follows. For i G I, j € J and t > 0, 

W i (m + n r m t)-E r i (m) 



(76) Era) 

(77) ST(i) 

(78) T r r(t) 

(79) TTf{£) 

(80) W n (t) 



(81) W r - m (t) 



^•(T[(m)+r4t)-S[(T[(m)) ^ 

n rn 

T r i {m + n r m t)-T r i (m) 
U^m + rf^-U'jjm) 

— r ' 

Nljm + n^t) 
— r ) 



The additional scaling by rf m used here is to ensure that the starting values 
of N ' all lie in a single compact set, namely the unit ball in M^_. This 
facilitates use of the properties of fluid model solutions described in Section 
4. It is easy to check using (5)-(6), (13)-(15) and the scaling property of A 
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that for each i G I, j G J and t > 0, 

(82) N r - m (t) =N r - m (0) + E r ;' m (t)-Sr(r. m (t)), 

(83) ir;' m (t)=c,t-5:^T:' m (t), 

(84) %' m (t)= / < A,(iV r ' m ( S ))d S . 

The following theorem summarizes essential properties of the above pro- 
cesses needed for our proof of multiplicative state space collapse. The proof 
of this uses arguments very similar to those in Sections 4, 5 and 6 of Bramson 
[5]. 

Theorem 6.1. Let 

(85) K = (1 + V + maxCjj . 

Fix T > 0. For each L>1, there exists a sequence of measurable sets {Q r L : r > 
0} and a family of positive constants {r £ ^:e E (0,1)} such that for each 
e € (0, 1) and r > r £ ^, 

(i) P{Q r L )>l-e; 

(ii) for each ui £Gl an d m = 0,1, . . . , \ rT\ , 

(86) \N r ' m (t,uj)-W ,m (s,Lu)\<K\t-s\+e for all s,t e[0,L] 
and there is a fluid model solution n(-) satisfying 

(87) ||7v r ' m (-,u;)-n(-)IU<£. 



Note that we have not indicated explicit dependence on T in the above 
as T will always be fixed in the application of this result. Also, to simplify 
notation, we have omitted explicit indication of the dependence of n(-) on 
r, m, uj, L and e. 

Before proving this theorem, we establish the following preliminary lemma. 
For this lemma and the proof of Theorem 6.1, let C = maxjgj Cj. 

Lemma 6.1. Fix T > 0. For each L>1, there exists a sequence of mea- 
surable sets {Q r L :r > 0} and a family of positive constants {r Et L :e € (0, 1)} 
such that for each e € (0, 1) and r > r e L> we have 

(88) P{ Ql)>i- £ 
and on £/£, for m = 0, 1, . . . , [rT\ , 

(89) ||r' m (-)-^(-)IU<e/4, 

(90) p r ' m (T r ' m (.)) - //(r' m (-))||L < e/4, 

where u r (t) = u r t, (5 r,m (T r ' m (t))) i = S^ m (T-'™ {t)) and (// (T r,m '(*)))< = 
H^Ti (t), for all t>0 and i G I. 
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Proof. Fix L > 1. Note that since the bandwidth allocations given by 
A(-) are bounded by C, for each r > and i 6 I, 37(0 * s Lipschitz contin- 
uous with Lipschitz constant C . Since this property is unchanged by the 
(fluid) scaling in (78), we have that T^' m (-) is also Lipschitz continuous with 
the same Lipschitz constant. It follows from this that ||T r ' m (-) ||^ < ICL. On 
combining this with Assumption 3.1, the fact that the interarrival times and 
document sizes are exponential and the memoryless property of the asso- 
ciated Poisson processes, by an argument similar to that used in verifying 
(5.19) of Proposition 5.1 in Bramson [5], we have that as r — ► oo, 

(91) ffi(nr' m (r- m (-)) -//(r- m (-))iu v n^' m (.) - =>o. 

m=0 

It follows that there exists a sequence {ag}fl satisfying ao = 0, ag E {t V 
a^_i, oo) for each integer I > 1, such that for each r >a,£, 

LrTJ r _ _ i 

fl |||5 r ' m (r r ' m (.)) - *r<F' m {-))h v p^O) - ^(OIU < ^- +2 



m=0 

For r > 0, define 



oo 2 

(5 ( r )=E^I 1 K^ + i)( r ) 

1=0 z 



and 

LrTj 



oi= n (ii^ m r' m (o)-^(T r ' m (o)iiLViir-(o-^(oiiL<^). 

m=0 ^ J 

Then, since 5(r) — > as r — > oo, for each e € (0, 1), there exists r £i L > such 
that 5(r) <e for all r > r e £. Combining the above, we conclude that for all 

(92) p(g r L ) > 1 - «y(r) > 1 - £. □ 



Proof of Theorem 6.1. Fix T > and L > 1. Also, fix a sequence 
of measurable sets {<?£ : r > 0} and a family of positive constants {r £: i ■ £ € 
(0, 1)}, as in Lemma 6.1. Without loss of generality, by Assumption 3.1, we 
may further assume (by choosing r £ ^ slightly larger if necessary) that for 
all r > r E) L, 

|^ r |<l+M and \fj, r | < 1 + 
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By Lemma 6.1, the uniform Lipschitz continuity of T ' (•) and (82), for 
each e G (0,1) and r > r e .L, we have that on Q r L , for each s,f£ [0,L] and 
m = 0,l,..., [rT\, 

(93) \E''' m {t)-E r ' m {s)\ < ! + |i/ r ||t - s|, 

(94) |5 r ' m (T r,m (t)) - 5 r,m (r- m ( S ))| < | + - S |, 

(95) {W^it) -W' m (s)\ < e + (K| V |//|)(1 +C)|t - s\. 

Then, part (i) of Theorem 6.1 follows directly from Lemma 6.1. Inequality 
(86) follows from Lemma 6.1, (95) and the choice of r s ^ and K. As for the 
last part, (87), of Theorem 6.1, for a proof by contradiction, suppose that 
there exists an eq G (0, 1) such that for each integer i > r e0t L, there exists a 
value ri of r such that > £, and such that there exists mi G {0, 1, . . . , [r^T\ } 
and u>£ G Q r ^ so that for any fluid model solution n we have 

(96) |P^™(^)-n(.)||L>eo. 

For the contradiction, we will show that there exists a subsequence of {N re ' 1 (•, 
u>e)}^l 1 that converges uniformly on [0,L] to a fluid model solution. For 
this, note that {^'"^(0,^)}^ is bounded by one, and \T l ' mi [ {-,u i )}^ 1 
is uniformly bounded and equicontinuous on [0, L], by the uniform Lips- 
chitz continuity of T ' . Thus, using the Ascoli-Arzela theorem, we have 
that along a subsequence, {iV e ' ^(O,^)}^ converges to a finite value and 
{T re ' me (■,lu^)}'^ 1 converges uniformly on [0,L]. Fix such a convergent sub- 
sequence and denote the respective limits by n(0) and r(-). Then, using the 
convergence of {(E ri ' me (-,u e ),S ri ' mi (T ri ' m < implied by (89) 

and (90), we have that along the same subsequence, {N Ve ' me (■ , u)i)}fl 1 con- 
verges uniformly on [0,L] to n(-), defined by 

(97) n i (t) = n l (0)+v i t-fi i T i (t), te[0,L],ieL 

Furthermore, by following the pathwise argument in Appendix B of Kelly 
and Williams [26], one can show that at each t G (0,L] where the derivative 
of n(-) exists (this means the left-hand derivative at t = L), we have that 
(22) and (23) hold with n in place of n there. It follows that n(-) has the 
same properties as a fluid model solution on [0, L\ . We can easily extend n to 
be a fluid model solution on [0, oo) by defining it on (L, oo) to equal /(• — L), 
where / : [0, oo) — > is a fluid model solution satisfying /(0) = n(L). (The 
existence of such a fluid model solution follows from Theorem B.l in [26].) 
This yields the desired contradiction of (96). □ 
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6.2. Proof of multiplicative state space collapse. We will use Theorem 
6.1 to show that for T > fixed, for each L > 1, there exists L > L such that 
for each e G (0, 1), for all r sufficiently large, with probability at least 1 — e, 
the left member of (33) is dominated by 

(98) i|]v r ' (.) - A(Tr r '°(.))ii [0 ,L] + ^p 1 \W ,m i-) - A(Tr' m (-))ii [I)L] , 

where the quantities {N r ' m , m = 0, 1, . . . , \rT\ — 1} can be approximated by 
fluid model solutions over [0, L\. The results of Section 4 on the behavior 
of fluid model solutions, especially Proposition 4.4, can then be combined 
with a suitable choice of L and the assumptions of Theorem 5.1 on the 
initial conditions to prove multiplicative state space collapse. We now give 
the detailed proof. 

Proof of Theorem 5.1. Fix T > 0. For each r > 0, let H r T denote the 
left member of (33). Since, for each r > and t > 0, we have 

(99) N r (t) = N r (rt), W r (t)=W r {rt) = A(M r )- 1 N r {rt), 
it follows that 

ii]v r (.)-A(Tr(.))ii r T 



(ioo) n\ 



\W(-)\\ rT Vl 



For L>1 fixed, intervals of the form [m, m + Ln^J, m = 0, 1, . . . , \ rT\ , cover 
[0, rT] since > 1 for each m £ Z + . Hence, for each r > and t € [0, rT], the 
following is well defined as a random variable taking values in {0, 1, . . . , [rT\ }: 

(101) m r (t) =inf{m:m < t < m + Lrf m }. 

For any r > and t € [0, rT] such that t > Ln^, we have m r {t) > 1 and then 

(102) t-{m r {t)-\)>Bf? mr{t) _ x 

since, otherwise, m r (t) could be replaced by a value of m strictly less then 
m r (t). Thus, for each r > and t € [0,rT] such that t > Ln^, we have 

(103) m r {t) - 1 + Ln r mr(t) _ 1 < t < m r (t) +Ln r mr(t) , 

where m r (t) G {1, 2, . . . , [rT\ }. From (100) and (103), we have that H r T is 
dominated by 

\\W(.) - A(Tr(.))ll[o,Z %] [rTJ 11^(0 - A(TT (.))ll [m _ 1+ i^ +z j 
+ sup — 

n n '"=1 n m-1 



(104) =\\N r '°(-)-A(W r '\- 



[0,L] 



+ S pr^o - A(Tr^ 1 (.))ii [ i l(1+ x^ ) /^_ l]) 



4G 



KANG, KELLY, LEE AND WILLIAMS 



where we have used the scaling property of A(-) in obtaining the equality 
(see Proposition 4.1). 

Recall the definition of the constant K > 1 from (85). Given L > 1, let 
L = 4KL. Below, we refer to the sequence of measurable sets {Q r L : r > 0} 
introduced in Theorem 6.1. Focusing on the right endpoints of the time 
intervals appearing in the last term of (104), we next show that, for all r 
sufficiently large, on Q r L , for m = 1, . . . , \rT\ , we have 



(105) 



1 + L7 4 <i, 

n m-l 



For this, note that, by (80), we have, for m = 1, 2, . . . . 



N r {m) _N r {m-l + l) 



N 



r,m— 1 



(106) 

n m-l n rn-l 

On the other hand, by (75), for m = 1, 2, . . . , 

7V r (m-l) 



m—l 



(107) 



\N r ' m - l m 



n 



m—l 



< 1. 



It follows from (106), (107) and Theorem 6.1 that for each e 6 (0, 1), for each 
r ^ r e,L an d m = 1, . . . , \ rT\ , on Q r L , we have 



|iV r '(m)| 



V 



m—l 



m—l 



n 



m—l 



(108) 



< 



< 



N 



\m— 1 



n 



m—l 



V 1 



■n 



m—l 



+ e VI 



<(l + J fC-l + l)Vl 
< 3K 

and hence (105) holds. On combining (104) with (105), we have that for 
each e £ (0, 1), for all r > r £) ^, the following inequality holds on Q r L : 



(109) 



^<||iY r ' (.)-A(^ r ' (-))ll [0 ,L] 

+ L s T u J p 1 ||^(.)- A(^(.))|| [Iir 

m=0 ' 



We will now estimate the two terms on the right-hand side of (109). In 
brief, the idea is to use Theorem 6.1 to show that for all r sufficiently large, 
for each oj E (7£, m = 0, 1, . . . , [rT\ , there is a fluid model solution h that 
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is uniformly close to N ' (-,w) over the time interval [0, L]. For the first 
term in (109), we then use the assumptions on the initial conditions and 
Proposition 4.5 to show that for all r sufficiently large, with high probability, 
h is uniformly close to the invariant manifold A4 a on [0, oo) and use this to 
control the first term in (109). For the last term in (109), we use Proposition 
4.4 to choose L (independent of r,u or m) so that h is uniformly close to 
the invariant manifold M a on [L, oo) and so, for all r sufficiently large, 
uj € Q r L and m = 0, 1, . . . , \rT\ — 1, for each t G [L, L], there exists a point n£ 
(depending on r,iv,m,t) in M a that is uniformly close to N ' (t, u). Then, 

|7V r * m (t,c;)-A(Tr' m (t,u;))| 

< \N r ' m (t,Lu) - h* t \ + \h$ - A(AM~ x h$)\ 
+ lAiAM- 1 ^*) - AiAiM^^W^it,^))], 

where each of the last three terms can be made small (uniformly for u> € 
£/£, m = 0, 1, . . . , \ rT\ — 1, t G \L, L), for all r sufficiently large), by the choice 
of hi, the fact that n = A(w(n)) for a point n on the invariant manifold 
M a , the continuity of A and the convergence of (M r ) _1 to Af _1 as r — > oo. 
We now give the full details of the argument. In the first paragraph below, 
we develop estimates that will be used for both the first and the second term 
on the right-hand side of (109). (Accordingly, we consider values of m that 
include [^^J , as this value may be zero.) 

Fix i] 6 (0,1). [In this proof only, we locally reuse the symbol r\ for a 
positive constant in (0,1). This is distinct from the use of r) elsewhere as 
a probability measure.] By Proposition 4.3, there is a constant D(9/8) > 
9/8 such that any fluid model solution n(-) satisfying |n(0)| < 9/8 satisfies 
\ n (t) \ < D(9/8) for all t > 0. By the uniform continuity of A(-) on compact 
sets in (see Proposition 4.1) and the fact that (M r ) _1 — > M _1 as r — > oo 
(see Assumption 3.1), there are constants r v > and 7 S (0,?7/4) such that 
for all r >r v , 

(110) |A(AM- 1 n)-A(A(M r )- 1 n')|<- 

8 

whenever n,n' £ |n| V \n'\ < 1 + D(9/8) and \n — n'\ < 27. By Proposi- 
tion 4.5, there exists e € (0,7/2) such that if n(-) is a fluid model solution 
satisfying |n(0)| < 9/8 and d(n(0), M a ) < 2e, then 

(111) d(n(t),M a ) <7 foralH>0. 

Let L = Tg/g j£ , where Tr j£ is defined in Proposition 4.4. Set L = AKL 
as above. Then, by Theorem 6.1, for each r > r e ^, m = 0, 1, . . . , |_ r ^J an d 
w € G T L , there exists a fluid model solution n(-) such that 

(112) ||F r ' m (.,u,)-n(.)||L<£. 
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We fix such r, m and ui for the remainder of this paragraph. Since |iV r ' m (0, uj)\ < 
1 and e < 1/8, we have |n(0)| < 9/8. It then follows from Proposition 4.4 and 
the choice of L that 

(113) d(h(t),M a ) <£ foralU>I. 
For each t G let n* t G A4 a such that 

(114) \n{t)-h* t \<e. 

On combining this with (112), we obtain, for each t G [L,L], 

(115) \N r,m {t,uj)-h* t \ <2e< 7 . 



(116) 



By Proposition 4.3 and the fact that |ra(0)| < 9/8, we have \n(t)\ < D(9/8) 
for all t > 0. So, by (114) and (112), for each t G [L,L], 

\n* t \ <e + D(9/8) < l + D(9/8) 

and 

|iV r ' m (i,cj)| <e + ,D(9/8) < l + D(9/8). 

On combining the above and using the fact that W ' = A(Ar) -i JV , 
we have, for each r > r e ^ Vr,, m = 0, 1, . . . , [r^Tj — 1, u G £/£ and t G [L, L] , 

\N r ' m (t,uj) - A{W ,m (t,uj))\ 

< \N r ' m (t,uj) - n*| + \n* t - &{AM~ l h*)\ 

+ lA^Af" 1 ™*) - A(A(M r )" 1 iV r ' m (i,w))| 

77 77 

<2e + + |<|. 

For the second inequality, we have used (115), part (iv) of Theorem 4.1 and 
(110). This takes care of estimating the last term in (109). 

To estimate the first term on the right-hand side of (109), we need to use 
the initial behavior of N . By the assumption in the theorem, 

(117) |7V r,0 (0) - A(W ,O (0))| = |iV r (0) - A(# r (0))| 

in probability as r — > oo. By Proposition 4.2, we have A(W T '°(0)) G Ai a and 
so it follows that 

(118) d(N r,0 (0), M a ) -» in probability as r ^ oo. 
Let r £ > such that 

(119) P(d(N r '°(0),M a )>e)<e for all r>r £ . 
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For each r>r £ Vr £j L and u G Q r L satisfying d(N r ' (0,uj),M a ) < e, by (112), 
we have 

(120) |n(0)|<l + e<f and d(n(0), M a ) < e + e = 2e. 

For such r, it follows from (111) that for each t G [0, oo), there exists 
h* G Al a such that \h(t) -n* t \ < 7. Hence, by (112), for t G [0,L] C [0,L], we 
have 

(121) \N r '\t,u)-n* t \<e + 1 <^-. 

Then, in a manner similar to that used in showing (116), we have, for all 
t G [0,L], r > r £ V r £>L V r,, and w G <5£ satisfying d(iV ' (0, w), M a ) < e, 

(122) |7V r '°(t, u) - A{W r '°(t, w) )|<( fi + 7 ) + 0+ 2<2. 

By combining Theorem 6.1, (109), (116), (119) and (122), we have that, 
for all r > r £ V r £ ^ V r„, 

P(H r T > V )< p({Ht >rtng r L n {d(N r '°(0),M a ) < e}) 

(123) + P{(g r L ) c ) + P(d(N r '°(0),M a ) > e) 

<0 + £ + e = 2e < ?. 

4 

Since G (0, 1) was arbitrary, it follows that Tt T T — > in probability as 
r — ► oo. □ 

7. Proof of diffusion approximation when a = 1. Throughout this sec- 
tion, we assume that a = l and that the local traffic condition, Assumption 
5.1, holds. To prove the diffusion approximation result, Theorem 5.2, we 
shall use the invariance principle in [20]. A key assumption for that theory 
to yield convergence (rather than just C-tightness) is that there is existence 
and uniqueness in law for the limit diffusion process W, which follows in 
the case a = 1 from work of Dai and Williams [10]. We first verify that the 
basic assumptions of [20] and [10] are satisfied by the state space Wi [see 
(47)] and directions of reflection {7 J :j G JT}. For this, we need the following 
definition. Also, recall the definition of the matrix B following (47). 

Definition 7.1. A dxd matrix D is completely-S if and only if, for each 
principal submatrix D of D, there exists a vector x > such that Dx > 0. 
(Here, a principal submatrix of D is a matrix obtained by deleting all rows 
and columns of D with indices in some strict subset of {1,2, ... ,d}.) 

Lemma 7.1. The J x J symmetric matrix ABA' is positive definite and 
invertible. The inverse matrix (ABA')" 1 is positive definite and completely- 
S. 
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Proof. Since A has full row rank and B is strictly positive definite, 
the symmetric matrix ABA' is also strictly positive definite. Indeed, for 
a nonzero vector x G M J , x'ABA'x = (A'x)' 'B(A'x). Since A has full row 
rank, A'x is also nonzero and then, since B is strictly positive definite, we 
have that (A'x)' B(A'x) > 0. Since ABA' is strictly positive definite, it is 
invertible and the inverse matrix (ABA') -1 is also strictly positive definite. 
It then follows that (ABA') -1 is a P-matrix, that is, all principal minors are 
positive, and hence (ABA') -1 is completely-S (see [9], especially Theorems 
3.3.7, 3.9.11 and Corollary 3.9.13 for the relationship between P-matrices 
and completely- S matrices). □ 

The results we need to apply from [20] to prove Theorem 5.2 require that 
Assumptions (Al)-(A5) and 5.1 of [20] hold. We shall not fully describe 
these general conditions. However, we shall now indicate the meaning of 
these assumptions for the context treated here. Assumptions (Al)-(A3) are 
restrictions on the state space and Assumptions (A4)-(A5) pertain to the di- 
rections of reflection, all for the limit diffusion. As noted in [20], Assumptions 
(Al)-(A3) are satisfied by a convex polyhedron with nonempty interior that 
is described as the intersection of a minimal set of half-spaces. As we shall 
see in the next lemma, the state space Wi has this convex polyhedral form. 
Assumption (A4) requires that the reflection vector field on each boundary 
face be uniformly Lipschitz continuous and of unit length. Since our reflec- 
tion vectors are constant unit vectors on each boundary face, this condition 
is trivially satisfied in our context. Assumption (A5) imposes geometric con- 
ditions on the directions of reflection. These conditions are generalizations 
for domains with piecewise smooth boundaries of conditions identified ear- 
lier by Dai and Williams [10] for existence and uniqueness of SRBMs living 
in convex polyhedral domains with a constant direction of reflection on each 
boundary face. The conditions of [10] are labeled as Assumption 1.1 in [10] 
and as Assumption 5.1 in [20]. (We shall use the latter label here.) For sim- 
ple convex polyhedral cones (as we have here with Wi), this assumption can 
be expressed as a completely- S condition on a suitable matrix formed using 
the directions of reflection and the normals to the boundary faces. We will 
show that this condition is satisfied in our context, and, as a consequence, 
Assumption (A5) of [20] will follow immediately. We now formally verify 
that the aforementioned assumptions all hold in our context. 

Lemma 7.2. Assumptions (Al)-(A5) and Assumption 5.1 of [20] are 
all satisfied by Wi and {j 3 : j £ J}. 

Proof. For j € J, let n 1 be the vector given by the jth row of (ABA') -1 . 
Since (ABA') -1 is symmetric, n 1 is also the jth column of (ABA') -1 . By 
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(47) and (45), we have 

(124) W! = {w £R 3 :n j -w>0 for all j £ J} 
and 

(125) W( ={ABA'q:q£R 3 + ,q j = 0} = {weW 1 :n j -w = 0}, j £ J. 

Since the n 3 ,j £ J, are linearly independent, it follows that Wi is a simple 
convex polyhedral cone with minimal representation given by the intersec- 
tion of the half-spaces {w £ M J • w > 0}, j € J. It is easy to see that 
Wi has nonempty interior. As noted in [20], it follows that Assumptions 
(Al)-(A3) of [20] are satisfied. Since the {7 J ,J £ J} are constant vectors, 
they are trivially uniformly Lipschitz continuous and so Assumption (A4) 
of [20] holds. As noted in Section 5.2 of [20], Assumption (A5) will hold if 
Assumption 5.1 of [20] holds. For this, since Wi is a simple convex polyhe- 
dron, it is equivalent to verify that the matrix NR is completely-S*, where 
N is the J x J matrix whose rows are given by the normals re- 7 , j £ J, and R 
is the matrix whose columns are given by the vectors 7 J , j £ J. (In fact, ./V 
is normalized so that the rows have unit length in [20], but one may equiva- 
lently use the unnormalized matrix N.) Now, N = (ABA 1 ) -1 and R = I, the 
J x J identity matrix, so NR = (ABA')" 1 , which we know is completely-S" 
by Lemma 7.1. □ 

Throughout this section, we will need various constants in inequalities. 

Some of these constants are denoted by Cj, for j = 0, 1, 2, The hat has 

been added to the notation here simply to distinguish these constants from 
the bandwidth capacities Cj . The next two lemmas provide basic ingredients 
for the proof of Theorem 5.2. 

Lemma 7.3. There exist constants C\ > 0, C 2 > 1 and f > such that 
for each T > 0, 

(126) C 1 \\W r (-)\\T<\\N r (-)\\ T <C 2 \\W r (-)\\T forallr>f. 

Proof. By Assumption 3.1, there exists f > such that \\A(M T )" 1 \\ < 
2||ylM _1 1| and \n r \ < 2\fi\ for all r > f. For each t > 0, by (17), we have that 

(127) \W r (t)\ < P(M r )- 1 |||iV r (*)|. 

Now, ||AM _1 || > and so by letting c\ = (2||ylM~ 1 H) -1 , we see that the 
left-hand inequality in (126) holds for any T > 0, for all r > f. On the other 
hand, for each i £ I, there is at least one ji £ J such that Aj^ > and then 
using (17) again, we obtain that, for each t > 0, 

AM^Nl^KW^t). 

By letting C2 = max(l, 2|^|Imaxjgi(^4j i j) _1 ), we have that the right-hand 
inequality in (126) holds for any T > and r > f. □ 
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Lemma 7.4. The sequence of processes {W r (0) + X r (-),r > 0} is C- 
tight. 

Proof. Recall that for each t > 0, 

(128) X T (t) = A{M r y l {E r {t) - S r (T r (t))) + r(Ap r - C)t. 

Since each route must use at least one resource, for each n £ and i £ 
I, the bandwidth allocation Aj(n) must be bounded by C = maxjgjCj. It 
follows that for each r > 0, T r (-) is uniformly Lipschitz continuous with 
Lipschitz constant C and hence {T r (-)} is C-tight. On combining this with 
the functional central limit assumption (21), the convergence of {r(Ap r — 
C)t, r > 0} and {M r , r > 0} and the convergence assumption on ^""(O) made 
at the end of Section 3, the desired result follows. □ 

With the above preliminaries in place, we are now ready to address the 
main part of the proof of Theorem 5.2. In the following, we shall reuse some 
notation for local proof purposes. In particular, £ and £ are used for different 
purposes here than earlier in the paper. 

Recall, from (36), that for each r > and t > 0, 

(129) W r (t) = W r (0)+X r (t) + U r (t) 
and that, by (17), 

(130) W r (t) = A(M r )~ 1 N r (t) = W r (t) + t(t), 
where 

(131) W r (t) = AM~ 1 A{W r (t)), 

(132) i r (t) = AM~ 1 (N r (t) - A{W r (t))) + v4((M r ) _1 - M~ 1 )N r (t). 
By Proposition 4.2 and the definition of Wi from Ai\, we have that 

(133) W r (t) e Wi for all * > and for all r > 0. 
Returning to (129), for 5 > fixed and each r > 0, t > 0, j 6 J, 

( 134 ) ^{t) = jT l m r [s)M) < 5} dUj(s) + (fit), 

where 

( 135 ) (?^ t )=J^\ d{ wr {s) ,w i) > s} d u;(s). 

A main step in establishing the diffusion approximation result, Theorem 
5.2, is to show that for each j € J, with probability tending to one as r — > oo, 
the process U r increases only when W r (or W r ) is near the boundary portion 
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W]'. The local traffic condition (Assumption 5.1) is used in verifying this in 
the following lemma, which also shows that state space collapse holds and 
that for fixed T > 0, with high probability, we can obtain a uniform bound 
on ||W r (-)||T and on ||C> r (-)||T. 

Lemma 7.5. Suppose, in addition to the assumption that a = 1 and 
Assumption 5.1, we have that 

(136) |iV r (0)-A(W r (0))H0 

in probability as r — > oo. Then, for each T > 0, 5 > 0, there exist constants 
K(T, 5) > and r(T, 5) > such that for each r > r(T, 5), 

P(\\N r (-)-A(W r (-))\\ T <5,\\i r (-)\\ T <6, 

(137) 

\\C 5 (-)\\t = 0, \\W r (-)\\ T < K(T,5), \\U r (-)\\ T < K(T,S)) > 1 - 5. 

A main aspect of the proof of Lemma 7.5 is to show that for fixed T > 0, 
with high probability, we can obtain a suitable uniform bound on ||A rr (-)||'r 
[or equivalently, on ||V^ r (-)||r] for all r sufficiently large. We shall use an 
oscillation inequality for the proof of this. A local version of this inequality 
was established in Theorem 4.1 of Kang and Williams [20]. For a/1, one 
would need to use that local version. However, for the case a = 1 treated 
here, one can choose p in Theorem 4.1 of [20] to be arbitrarily large and 
consequently obtain a global version of the oscillation inequality. Here, for 
the case of a = 1 only, rather than using Theorem 4.1 of [20], we shall give 
an alternative proof of a global oscillation inequality by first invoking a 
linear transformation to transform Wi to the orthant and then applying an 
oscillation inequality developed earlier by Williams [43] for that state space. 
This inequality has a slightly simpler form than that which would follow 
from [20]. 

For the statement of the oscillation inequality, we need the following no- 
tation. For any < s < t < oo, let D([s,i],lR J ) denote the set of functions 
x: [s,t] — >M J that are right-continuous on [s,t) and have finite left limits on 
(s,t\, and for x G B([s, t], M J ), let 

(138) Osc(x, [s,t]) = sup{\x(v) — x(u)\ :s <u <v <t}. 

Proposition 7.1 (Oscillation inequality). There exists a constant Co > 
such that for any 5 > and any < s < t < oo, w,x,y € B([s, t], M J ) satis- 
fying 

(i) w(u) =x(u) + J2jaVj( u h :! for all u£[s,t], 

(ii) w(u) G Wi for all u G [s, t], 

(iii) for each j G J, 
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(a) yj (s)>0; 

(b) yj is nondecreasing; 

(c) yj(u) = yj {s) + J {sM l Ku , (t , )iW ^)< 5} dyj(v) for all u G [*,<]. 
the following hold: 

(139) Osc(to,[s,t]) <C (Osc(x,[s,t]) + 5), 

(140) Osc(y,[a,t]) <C (Osc(x,[s,t]) + <J). 

Proof. For w,x,y satisfying (i)-(iii) above, let w = (ABA')~ 1 w, x = 
(ABA')~ 1 x, y = y and R = (ABA')" 1 . Since ABA' is a bijection from R-[ 
onto Wi, the matrix with columns given by the vectors {7 J ,j G J} is the 
J x J identity matrix and for z G Wi, the distance d(z, Wf ) = n J • for 
each j G J [where n J is the vector given by the jth row of (ABA')" 1 ], it 
follows that w,x,y satisfy 

(i) w(u) = x(u) + Ry(u) for all u G [s, t] , 

(ii) w(u) G R+ for all u G [s,t], 
(hi) for each j G J, 

(a) &(*)>(); 

(b) yj is nondecreasing; 

(c) yj (u) = yj (s) + J (S)U] l^.^^i^i} dyj(«) for all u G [s,t]. 

Since ii is completely-,?, by Lemma 7.1, and w,x,y satisfy (i)-(iii) above, 
it is immediate from Theorem 5.1 of [43] that there is a constant c\ > 
depending only on R such that 

(141) Osc(u), [s, i]) < ci Osc(ir, [s, t]) + 6 max |n J | J , 

(142) Osc(y, [s,t]) < ci(Osc(£, [s, t]) + 5 max \n j \) . 

Applying the reverse linear transformation ABA' and making Co sufficiently 
large to absorb the factor maxjgj \n J \, it follows that (139)-(140) hold where 
the constant Cq can be chosen to depend only on ABA' (and its inverse). 
□ 

Proof of Lemma 7.5. Fix T > and 5 > 0. By the convergence as- 
sumed for the initial random variables {W r (0),r > 0}, the C-tightness of 
{W^ r (0) + X r (-),r > 0} established in Lemma 7.4, the multiplicative state 
space collapse established in Theorem 5.1 and the fact that (M r ) _1 — > M _1 
as r — > oo, we have that for each e > 0, there are constants Kq > 1 (not 
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depending on e) and ro(e) > such that for all r > ro(e), 



(143) P(\W r (0)\ < Kq, \\X r (-)\\ T <K )>1- 5 -, 

(144) P(\\N r (-) - A(W r (.))\\ T < e(\\N r (-)\\ T V 1)) > 1 - 
and 

(145) ||(M r )" 1 - M~ l \\ < e. 



The constants Kq and ro(e) will depend on T and 5 as well, but since these 
parameters are fixed throughout this proof, we do not explicitly indicate 
that dependence here. 

In the following, e > will be fixed. A specific, suitably small value of e 
will be chosen later (as a function of 5, T) to ensure that various inequalities 
hold [see (158)]. For r > 0, let 

O r * = {\W r (0)\<K ,\\X r (-)\\ T <K , 

||^(.)-A(^(.))||T< e (||iV'-(.)||TVl)}. 

From (143)-(144), we have that for all r > ro(e), 

(146) P(O r ' £ ) > 1 - 5. 

Now, for r > r (e), on O r ' e , by (130)-(132) and (145), we have 

(147) ||f(.)||T<C 3e ||(iV r (.)||rVl), 
where C 3 = ||^4M -1 || + \\A\\ V 1 and 

(148) t (i) = W r (t) - W r (t) for all t > 0. 

Since, by Proposition 4.2, A(W r (t,uj)) is an invariant state, it follows from 
the characterization of invariant states in Theorem 4.1 that for r > ro(e), 
t e [0,T] and u € O r ' £ , there exists q r (t, u) G R J + such that 

(149) A(W r (t,oj)) =diag(p)diag(K)- 1 AV(t,u;) 
and so by the definitions of W r and the matrix B, we have 

(150) W r (t,oj) = ABA'q r (t,Lu). 

We now turn to the behavior of U r . By (39), for each r > ro(e), t € [0,T] 
and j € JJ, 

(151) t)T(t) =r [\cj - (AA(N r (s)))j)ds. 

Jo 

We are interested in where UJ increases. Fix r > ro(e), J G J and w G O r ' e . 
Since the integrand in (151) is nonnegative, we concentrate on determining 
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where the integrand is strictly positive. At an instant s G [0,T] such that 
Cj - (AA(N r (s,iu)))j > 0, by Proposition 2.1 (iv), there is p r (s,uj) G such 
that Pj(s,u;) = and 

(152) iV;X S ^)=A t (^Xg^))( EfcgJ ^ (S '^ ) foralHGl 

By the local traffic Assumption 5.1, there exists an index ij G I such that 
Ajij > and = for all k j^j. Using the fact that pj(s,uj) = in (152), 

it follows that N[.(s,u)) = 0. Then, since to G O r ' £ , we have 

(153) (A(Tr-( S ,u,)));. < e (||7V'-(-,u/)|| T Vl) 
and so, by (149) and the local traffic condition, 

(154) g;( S , W )<e(||Ar(., w )|| T vl)prV 

Thus, letting re- 7 denote the vector given by the jth row of (ABA') -1 (see 
the proof of Lemma 7.2), using (150), we have that 

d(W r (s,uj), W{) =n j ■ W r (s,uj)/\n J \ 

= n j ■ ABA'q r (s,u)/\n j \ 

= <fj(s,^)/\n J \ 

<e(||iV'-(-,a;)|| T Vl)pr 1 S ./|^|. 

Let C4 = maxjgjj Ki.j\n?\. It follows from the reasoning above that for 
each r > ro(s), j G J, on O r ' £ , we have, for all t G [0, T], that 

(155) Uj{t)=J^ 1 { d (Wr(s),VV J 1 )<C4e(\\N r (-)\\TVl)} ^T( s )' 

that is, UJ(-) can increase only when d{W r {-),W{) < C 4 e{\\N r (-)\\ T V 1). 
Furthermore, 

w r (t) = w r (t) -c(t) 

= w r (o)+x r (t)-i r (t) + u r (t), 

where W r (t) G Wi- It then follows from the oscillation inequality in Propo- 
sition 7.1 that for r > ro(e), on O r,£ , 

Osc(W r ,[0,T])<C (O S c(W r (0)+X r (.)-t(-),[0,T}) 

+ C 4 e(\\N r (-)\\ T Vl)) 

(156) 

< C (2K + (2C 3 + C^£(\\N r (-)\\T V 1)) 

< <%(2K + (2C 3 + (%)eC 2 (||Vr(-)llT V 1)), 
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where we have used the definition of O r ' £ and (147) for the second inequal- 
ity and we have used Lemma 7.3 plus the fact that C 2 > 1 for the third 
inequality. Similarly, we obtain an oscillation bound for U r for r > ro(e) on 

QT,e. 

(157) Osc(l> r , [0,T]) < C (2K + (2C 3 + C 4 )eC 2 (\\W r (-)\\ T V 1)). 

On combining (156) with (148), (147) and Lemma 7.3, we have that for 
f > ^o(^)> on O r,£ , 

\\W r (-)\\ T V 1 < \W r (0)\ V 1 + Osc(VT(-), [0,T]) 

< |^ r (0)| V 1 + Osc{W r (-), [0,T]) + Osc(| r (-), [0,21) 

< Kq + (Co + i)(2K + (2C 3 + C A )eC 2 (\\W r (-)\\ T V 1)). 

Now, choose 

/ N f 1 8 
(158 e = min — s s s — -, s s s s— 

V2(C + 1)(2C 3 + C 4 )C 2 2K C7 2 (3 + 2C )(C 3 + C 4 ) 
Then, from the above, we conclude that for r > ro(e), on O r,£ , 

(159) ||W r (-)||TVl<2/^o(3 + 2C'o) 
and, using the fact that C3 > 1, we have 

e(\\N r (.)\\ T Vl)<eC 2 (\\W r (-)\\ T Vl) 

(160) < - 



C3 + C4 
<<5. 



Further, by (157), (159) and the definition of e, we have 
(161) Osc(L> r ,[0,T]) <2C (^o + <5). 

Note that since U r is nondecreasing and starts from zero, the above also 
provides a bound for ||?7 r (-)||r- Now, by (160), for r > ro(e), on O r ' £ , we 
have 

\\N r (-)-A(W r (-))\\ T <e(\\N r (-)\\ T Vl) 
<8, 

using (147), we have 

||r(-)||T<C3£(||^(-)||TVl) 

<8, 
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using (155), we have, for each j € JJ and t € [0, T] 



(162) 



f 

Jo 



1 



{d(w r (s,u),w{)<8} dU r j{s) 



and so by (134)-(135) 



(163) 



IIC r ' 5 (-)llT = o. 



Now, set K(T, 5) = max(2K (3 + 2C ), 2C (K + 5)) and r(T, 5) = r (e), 
where e > is given by (158). Then, for all r > r(T,5), by the above and 
(146), we have 



P(||iV^-)-A(^(.))||T<5,||| r (-)l|T<<5, 
(164) \\C' S i)\\T = 0, ||# r (-)llT < K(T,6), \\U r (-)\\ T < K(T,S)) 



Proof of Theorem 5.2. Assume that the hypotheses of Theorem 5.2 
hold. We will show that the conditions of Theorem 5.4 of [20] hold, from 
which it will follow immediately that W r converges in distribution as r — > oo 
to an SRBM with data (Wi, 9, T, : j E J}, rj). The joint convergence of N r 
with W r will then follow by the state space collapse established in Lemma 



The conditions of Theorem 5.4 fall into four groups. We treat each of 
these groups separately below. 

First, the diffusion state space Wi must be a convex polyhedron having 
nonempty interior described as the intersection of a minimal set of half- 
spaces and the directions of reflection {7-? :j G J} must satisfy Assumption 
5.1 of [20]. These properties were verified in the proof of Lemma 7.2. 

Second, one must verify that Assumption 4.1 of [20] holds. This condition 
amounts to verifying that W r ,X r ,U r satisfy conditions similar to (i), (ii), 
(iv) on W,X,U in the definition of an SRBM, except that the prelimit 
processes are allowed to have r.c.1.1. rather than continuous paths and small 
perturbations in the conditions (i), (ii) and (iv) are allowed. Furthermore, 
the sequence {Vl^ r (0) + X r (-)} is required to be C-tight. The latter follows 
from Lemma 7.4 and the former follows from the properties in (129)-(135), 
once we show that £ r — ► and <f r ' 5r — > in probability as r — > 00 for a suitable 
sequence {5 r } satisfying 5 r — ► as r — > 00. We verify the latter properties in 
the next paragraph. 

Recall the constants K(T, 5) and r(T, 5) from Lemma 7.5. Choose strictly 
increasing sequences of positive constants {K^^k > 1} and {r^^k > 1} such 
that for each k, > K(k, ^) and > r(k, 4), and — > 00 as k — > 00. Define 



> P(O r ' e ) >l-8. 



□ 



7.5. 
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5 r such that 5 r = 1 when r < n and <5 r = I when r £ (rfc,rfc +1 ] for k > 1. 
Then, 5 r — > as r — > oo and, by Lemma 7.5, for each A; > 1, for r& < r < r^+i, 

p(||A> r (-)-A(^(-))lk-<pllf(-)lk-<p 



(165) 

l|C r ^(-)IU = 0, ||W r (-)lk < l|f> r (-)IU < #fc) >i-\- 



It follows from this that £ r — > and C r ' 5r — > in probability as r — > oo. 
The conditions of Assumption 4.1 in [20] are then satisfied with Wi in 
place of G, r in place of n, j in place of i, W r in place of W n , W r in 
place of W n , W r (0) + X r in place of X n , U r in place of Y n , Yf(t) = 

Third, one must show that {VF r (0) +X r (-)} converges in distribution as 
r — > oo to a Brownian motion with drift 8, covariance matrix T given by 
(46) and initial distribution rj. [This amounts to verifying condition (vi) of 
Theorem 4.3 in [20] (with 8 in place of \x and rj in place of v there). This 
condition is needed so that weak limit points of {(W r ,X r ,U r )} will satisfy 
property (iii)(a) and the last part of (ii) in the Definition 5.2 of an SRBM.] 
To verify this condition, we first show that {T r (-)} converges in distribution 
to the deterministic process {p(t),t > 0}, where p(t) = pt. Indeed, by the 
fact that T r {-) is uniformly Lipschitz continuous with a Lipschitz constant 
which does not depend on r, we have that {T r (-),r > 0} is C-tight. Let T*(-) 
be a weak limit point of this sequence, obtained as a limit in distribution 
along a suitable subsequence. We will prove below that, almost surely, 

(166) T*(t)=pt foralH>0, 

from which the desired convergence in distribution of {T r (-)} to T*(-) fol- 
lows. 

For the proof of (166), by passing to a subsequence, we may assume that 
{T r (-)} converges in distribution to T*(-) as r — > oo. Now, by (16) and (5), 
on dividing by r, we have, for each i € I, 

NV(t) N r (0) = = = 

(167) = +E r (t) - S r(TT(t)), 
r r 

where 

From (165), we can conclude, using the continuity of A(-), that 

N r (-) 

=> as r — > oo 
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and, from (21), it follows that 

(#(0,^(0)^K-),M(-)) asr^oo, 

where u(t) = vt and p(t) = pt for all t > 0. Thus, on letting r — > oo in (167), 
we obtain that, almost surely, for each i G I, 

= Ui t- fiiT* (t) for all f > 0. 

The desired result (166) follows immediately. 

It now follows from (21) and the assumption on the convergence in distri- 
bution of W r (0) made at the end of Section 3 that {W r {0) , E r (■) , S r (-),T r (-)) 
converges in distribution to (W (0) , E (•) , S (•) , p(-)) as r — ► oo, where W(0) 
is independent of the Brownian motion (E (•),£>(•)) and W(0) has distribu- 
tion r] on Wi. On combining this with a random time change theorem and 
Assumption 3.1, we conclude, using (128), that {VP*(0) + X r (-),r > 0} con- 
verges in distribution to a Brownian motion with drift 0, covariance matrix 
r given by (46) and initial distribution r/, as desired. 

Fourth, and finally, we must verify condition (vii) of Theorem 4.3 of [20] 
(with 6 in place of p there). This condition requires that for any weak limit 
point (W,X,U) of (W r ,X r ,U r ), {X(t) - 8t,t> 0} is a martingale relative 
to the filtration generated by (W,X,U). [This condition is needed so that 
property (iii)(b) in Definition 5.2 of an SRBM will be satisfied by weak limit 
points of {(W r ,X r , U r )}.\ Let 8 r = r(Ap r - C). Then, by Proposition 4.1 of 
[20], to show that this final condition holds, it suffices to verify the following 
properties for the prelimit processes: 

(a) for each r, {X r (t) — 8 r t,t > 0} is a martingale with respect to the 
filtration generated by (W r ,X r ,U r ); 

(b) 9 r — > 6 as r — > oo; 

(c) {X r (t),r > 0} is uniformly integrable for each fixed t > 0. 

We now verify these three properties. 

It is well known (see Theorem 4.1, Chapter 6 of [12]) that for the continuous- 
time Markov chain N r , for each r > and % € I, 

Xl(t) = NT(t) - JVT(0) - \\v\ - ^HN r ( S ))) ds, t > 0, 

J o 

defines a martingale with respect to the filtration generated by iV 7 *, that is, 
with respect to {^J, t > 0}, where Tl = a{N r (s) : < s < t}. Setting 

v r (r 2 t) 

r(t) = xK_2^ r t =r r , v 

we have that for each r > 0, {x r (t),J 7 [,t > 0} is a multidimensional martin- 
gale. Now, using the expressions (17) and (36) for W r , and the expression 
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(39) for U r , we see that 

A(M r )~ 1 x r (t) = W r (t) - W r (0) - Ap r rt + Crt - U r (t) 

= X r {t)+r(C - Ap r )t. 

Since W r and U r are adapted to the filtration {J-J,t > 0}, so is X r , and it 
then follows that 

X r (t)-9 r t = A(M r )- 1 X r (t), t>0, 

is a martingale with respect to the filtration generated by (W r ,X r ,U r ). 
By (12), 6 r — > 9 as r — > oo. Finally, since the mean of a Poisson random 
variable is the same as its variance and the rate of increase of T r is uni- 
formly bounded, it is straightforward to verify, using Assumption 3.1, that 
{X r (t),r > 0} is uniformly integrable for each t > 0. Thus, we have verified 
conditions (a)-(c) above and so condition (vii) of Theorem 4.3 in [20] holds. 

We have now verified all of the hypotheses of Theorem 5.4 of [20] and 
so it follows that W r converges in distribution as r — > oo to an SRBM W 
associated with the data (Wi, 9, T, {j 3 ' : j G 3},rj). From (165), we have that 
N r — A(W r ) converges in distribution to the zero process as r — > oo. Since 
A(-) is continuous, it follows that we have the joint convergence of (W r ,N r ) 
in distribution as r — > oo to (W, A(W)). □ 

8. Conclusion. A bandwidth sharing policy corresponds to a generaliza- 
tion of the notion of a processor sharing discipline from a single resource to a 
network with several shared resources. In particular, weighted a-fair policies 
provide a tractable theoretical abstraction of the bandwidth sharing effected 
by decentralized packet-based end-to-end congestion control algorithms such 
as TCP. It is known [3, 11] that, for flow-level models with exponentially 
distributed file sizes, weighted a-fair policies are stable when the average 
load on each resource is less than its capacity. 

Weighted a-fair policies can nevertheless suffer from entrainment of re- 
sources, whereby congestion at some resources may prevent others from 
working at full capacity: this is manifested under diffusion scaling, where a 
Brownian model for the workload process lives in a cone which may be a 
strict subset of the positive orthant. 

Under weighted proportional fair sharing (a = 1) and a mild local traffic 
condition, this paper has shown how multiplicative state space collapse can 
be combined with an invariance principle to establish a diffusion approxima- 
tion for the flow-level model. For proportional fair sharing (equal weights), 
this diffusion has a product form invariant measure which, when integrable, 
can be normalized to yield the unique stationary distribution for the diffu- 
sion. This result extends to the case where file sizes are distributed as finite 
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mixtures of exponentials. Thus, in the diffusion limit, more networks can 
have product form stationary distributions than are known in the prelimit 
(see Bonald and Proutiere [4]). We also indicated some open problems for 
and an extension to models with routing. 
Bandwidth sharing policies outside the class of weighted a-fair policies 
may avoid entrainment, although such policies may not be easy to effect via 
decentralized packet-based end-to-end congestion control algorithms. 
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